frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•12mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•12mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•11mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: How to be SOC2 Type 2 compliant as a solo-entreprenuer?

143•sochix•1d ago•124 comments

Note-taking software,Novel ideas

2•huaqing•7h ago•4 comments

Tell HN: Dont use Claude Design, lost access to my projects after unsubscribing

293•pycassa•2d ago•84 comments

Ask HN: We just had an actual UUID v4 collision...

473•mittermayr•1w ago•341 comments

Ask HN: What happened to ssh-audit.com?

2•Bender•14h ago•1 comments

Which country will be the first to pass laws limiting Meta Ray-Ban glasses?

25•nothrowaways•1d ago•7 comments

Ask HN: What are you working on? (May 2026)

285•david927•5d ago•1096 comments

Rumors of my death are slightly exaggerated

1671•CliffStoll•1w ago•258 comments

Viable open source Claude Design alternative?

21•splatzone•1d ago•5 comments

Load testing in your infra, not cloud

3•vitalicset•20h ago•0 comments

Ask HN: Can I take Meta to court for banning business Insta or FB account?

9•milanspeaks•22h ago•5 comments

Ask HN: What happened to the movie "Pirates of the Silicon Valley"

8•acossta•23h ago•3 comments

XS Programming Language

7•xs-lang•1d ago•4 comments

Ask HN: What are you working on (non-AI)?

37•BrunoBernardino•3d ago•49 comments

You've reached the end!