frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•11mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•11mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•11mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: What do you still do manually in 2026 that should be automated?

4•lishunsheng•2h ago•1 comments

Ask HN: Terafab – Smart move or insane financial risk?

3•imheretolearn•8h ago•1 comments

Ask HN: Who got hired with Who wants to be hired? (On 2026)

13•Gooblebrai•22h ago•8 comments

Ask HN: Is anyone seriously considering a career change?

26•zeven7•13h ago•19 comments

Time to add option in Hacker News "AI excluded Show HN"

16•tukunjil•11h ago•5 comments

Ask HN: How to start up as an individual developer?

5•alexyan0431•17h ago•3 comments

Reviewing in the Age of AI

7•maxalbarello•13h ago•5 comments

Rumors of my death are slightly exaggerated

29•CliffStoll•14h ago•4 comments

Ask HN: Is the future everyone having 100 MCP processes running on their PC?

6•ex-aws-dude•23h ago•4 comments

Ask HN: The death of software development as a job?

15•piratesAndSons•1d ago•26 comments

Ask HN: Is there a term for feeling sad about forced AI adoption?

21•ge96•1d ago•32 comments

Ask HN: Is the Job Market Actually Bad?

126•idontwantthis•4d ago•201 comments

Ask HN: Best Embedding Models?

17•devstein•2d ago•18 comments

Ask HN: How do I get freelance developer jobs?

5•downbad_•12h ago•4 comments

Ask HN: Why would we care about "extended time horizons" and LLMs?

5•ozozozd•2d ago•5 comments

Ask HN: Where are you getting your AI news from?

10•baetylus•2d ago•15 comments

Ask HN: Those building Swift apps without touching Xcode, what is your workflow?

18•p5v•3d ago•9 comments

Ask HN: Is a hands-off, family-friendly, de-Googled "home lab" feasible?

10•strix_varius•2d ago•11 comments

Why Does a Single Firefox Tab Take Almost 1.5GB RAM?

6•syeare•2d ago•9 comments

Ask HN: When did you move from AI agentic loops to simpler deterministic system?

7•laxmena•2d ago•5 comments

Tell HN: The saddest irony of my/our craft

21•dakiol•2d ago•16 comments

Can anyone point me to a ShowHN first post policy documentation?

11•upwindchange•2d ago•12 comments

You've reached the end!