frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•5mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•5mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•5mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Tell HN: X is opening any tweet link in a webview whether you press it or not

557•stillatit•19h ago•475 comments

Ask HN: Who is hiring? (November 2025)

383•whoishiring•1d ago•456 comments

Ask HN: How to deal with long vibe-coded PRs?

164•philippta•6d ago•321 comments

Ask HN: Who wants to be hired? (November 2025)

194•whoishiring•1d ago•377 comments

Ask HN: Lawyers of HN, how do you deal with AI slop?

4•gardnr•47m ago•1 comments

I built sbsh: Persistent terminal sessions with discovery, profiles, and an API

3•eminwux•5h ago•3 comments

Tell HN: Mechanical Turk is twenty years old today

92•csmoak•2d ago•62 comments

Ask HN: How did you find your job?

4•dirtybirdnj•9h ago•2 comments

Tell HN: Azure outage

884•tartieret•6d ago•804 comments

Ask HN: Where to begin with "modern" Emacs?

214•weakfish•3d ago•117 comments

Ask HN: Anyone else use FreePascal as their low level language?

66•rlawson•1w ago•53 comments

Ask HN: Who uses open LLMs and coding assistants locally? Share setup and laptop

345•threeturn•4d ago•188 comments

Google flagged my site for phishing and won't tell my why

13•iambateman•1d ago•3 comments

Is there a better way to install Windows drivers?

2•bloqs•15h ago•2 comments

I just trained a physics-based earthquake forecasting model on a $1000 GPU

13•ArchitectAI•1d ago•5 comments

Ask HN: How opiniated Is HN?

7•janikvonrotz•18h ago•15 comments

Ask HN: I underestimated how lonely building solo can be

6•paulwilsonn•1d ago•10 comments

Ask HN: Not treated respectfully by colleague – advice?

115•golly_ned•1w ago•127 comments

Tell HN: Twilio support replies with hallucinated features

159•haute_cuisine•6d ago•40 comments

Ask HN: Why I rarely see game dev startup here?

19•blindprogrammer•4d ago•12 comments

You've reached the end!