frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

$100 to a Debian Developer who can get Fresh Editor into Trixie

16•jph•3h ago•3 comments

Recruiters, How do you vet resume in 2026?

7•CoffeeSky•4h ago•1 comments

Donating AI credits to open source projects

5•happyPersonR•5h ago•3 comments

Ask HN: What Is the State of App Development in 2026?

90•karakoram•1d ago•66 comments

Ask HN: Corporate Disconnect Between "Tokenmaxxing" and Token Optimization

4•mc-0•6h ago•4 comments

Ask HN: What are your worst war stories bringing agentic applications into prod

8•yaoke259•17h ago•4 comments

Ask HN: How you pass legacy to others?

5•jojwong•15h ago•9 comments

I made an free unlimited captcha alternative

6•david3289•15h ago•3 comments

Ask HN: Any advice on how to learn good software architecture practices?

15•jimsojim•2d ago•10 comments

Ask HN: Students, What Impact Is AI Having on Your Education?

8•ciwolex•21h ago•1 comments

Ask HN: What Is an "AI Engineer"?

18•seattle_spring•3d ago•31 comments

Ask HN: If I cancel Codex today whats the next best local inference agent?

10•Bulbasaur2015•2d ago•8 comments

Ask HN: Does Claude Code remove the need for so many front-end frameworks?

10•ex-aws-dude•2d ago•14 comments

Ask HN: Is anyone working at least 4 hours daily on an Apple Vision Pro?

153•widenrun•5d ago•110 comments

Ask HN: How is your org managing PR review load as AI multiplies code output?

10•meteor333•1d ago•13 comments

Ask HN: How do you feel about posts about GenAI taking over the HN front page?

13•blenderob•3d ago•18 comments

FYI: Dreamina is shady; do not use

5•ronyeh•2d ago•0 comments

Ask HK: How are you building AI apps today?

7•Mnexium•2d ago•6 comments

Ask HN: I found out that I'm about to be laid off. How do people find jobs?

25•wwwthrowaway256•3d ago•19 comments

Garnix, the Nix CI, is shutting down

12•cinericius•3d ago•4 comments

You've reached the end!