frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•7mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•7mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•7mo ago
give https://pg.llmwhisperer.unstract.com/ a try

OpenAI API and ChatGPT are down

9•themanmaran•2h ago•1 comments

Ask HN: Is it time for HN to implement a form of captcha?

69•Rooster61•7h ago•100 comments

I built an AI agent that deploys a PR to production

2•amouehsan•2h ago•0 comments

Ask HN: Where is legacy codebase maintenance headed?

4•AnnKey•4h ago•2 comments

Ask HN: Any Microsoft employees/devs here? What's happening to Microsoft?

100•thehamkercat•2d ago•77 comments

Ask HN: Who wants to be hired? (January 2026)

167•whoishiring•6d ago•397 comments

Ask HN: What will you pick as your database of 2026 Yugabyte, TiDB or SurrealDB?

4•marknefedov•4h ago•1 comments

Ask HN: How do you use 5–10 minute gaps productively?

41•pea•4d ago•54 comments

Ask HN: Who is hiring? (January 2026)

353•whoishiring•6d ago•334 comments

Developing a high level language over Zig

2•ziyaadsaqlain•16h ago•2 comments

Ask HN: How would you decouple from the US?

19•yawa_me_worht•18h ago•7 comments

Implementing NaN Boxing in a Stack-Based VM

4•tracyspacy•1d ago•0 comments

Ask HN: We built an air-gapped document vault with encrypted print and export

3•KevinG777•21h ago•6 comments

RevisionDojo, a YC startup, is running astroturfing campaigns targeting kids?

451•red-polygon•3d ago•86 comments

Ask HN: What's a standard way for apps to request text completion as a service?

5•nvader•3d ago•3 comments

Ask HN: Is anyone aware of a LinkedIn mirror like xcancel.com for X?

11•danielfalbo•1d ago•8 comments

Ask HN: Anyone else seeing porn images in YouTube ad preview images?

4•OhMeadhbh•1d ago•6 comments

Cancelled 2x Cursor Ultra plans, here's why

8•throwawayround•7h ago•6 comments

Git analytics that works across GitHub, GitLab, and Bitbucket

3•akhnid•2d ago•1 comments

Ask HN: How do you do store-and-forward telemetry at the edge?

4•Aydarbek•1d ago•3 comments

Amazon Prime AI overviews can't even get the basics right

43•PyWoody•2d ago•13 comments

Ask HN: Has anyone else been struggling with search lately?

31•areoform•3d ago•19 comments

Ask HN: Reading list for being a better engineer?

44•drekipus•5d ago•16 comments

Ask HN: How do small teams make sure recurring tasks don't slip?

7•batels•2d ago•15 comments

Anyone building software for wearable tech?

16•ssc23•3d ago•15 comments

I made a lofi page for late night work

20•onmyway133•3d ago•8 comments

My Logitech mouse became unusable, Logi Options+ can't validate certificate

12•enescakir•1d ago•10 comments

Ask HN: What did you learn in 2025?

21•kiernanmcgowan•5d ago•8 comments

Tell HN: I'm having the worst career winter of my life

98•mariogintili•6d ago•126 comments

What do people usually do with spare Android phones? Any practical use cases?

18•AndroidShare•4d ago•21 comments