frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•7mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•7mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•7mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: How do you use 5–10 minute gaps productively?

23•pea•11h ago•31 comments

LinkedIn Prevents You from Deplatforming

38•jeffkumar•11h ago•39 comments

Ask HN: Who wants to be hired? (January 2026)

157•whoishiring•2d ago•320 comments

Ask HN: How is your work making the world a better place?

9•AbstractH24•6h ago•4 comments

Ask HN: Who is hiring? (January 2026)

345•whoishiring•2d ago•241 comments

What do people usually do with spare Android phones? Any practical use cases?

14•AndroidShare•22h ago•15 comments

Ask HN: Reading list for being a better engineer?

36•drekipus•1d ago•15 comments

Ask HN: What's the future of software testing and QA?

21•sjgeek•1d ago•13 comments

Svger CLI – Zero-dependency SVG to component tool, 52% faster than SVGR

3•navid_rezadoost•10h ago•1 comments

Tell HN: Happy New Year

442•schappim•4d ago•207 comments

Ask HN: What did you learn in 2025?

17•kiernanmcgowan•1d ago•5 comments

Ask HN: Why not ban first-person pronouns from conversational AI?

6•libertyit•14h ago•6 comments

Ask HN: What if a language's structure determined memory lifetime?

4•stevendgarcia•19h ago•15 comments

Tell HN: I'm having the worst career winter of my life

93•mariogintili•2d ago•119 comments

Ask HN: Expository/Succinct Books on Modern Physics

26•rramadass•2d ago•25 comments

Ask HN: Who is using Nebula (mesh VPN)?

5•cdsl•1d ago•5 comments

Ask HN: Replacement for MacUpdater which reached EOL on 2026-01-01

2•croemer•1d ago•3 comments

How to use AI to augment learning without losing critical thinking skills?

23•mintsuku•3d ago•13 comments

It's 2026 now. Is Webpack 6.x going to happen?

3•narukeu•1d ago•7 comments

Android Tablet as Mac Display

7•jefferyabbott•1d ago•2 comments

Ask HN: When do we expose "Humans as Tools" so LLM agents can call us on demand?

47•vedmakk•3d ago•32 comments

Tell HN: Instagram Web has been broken for weeks

6•thrdbndndn•1d ago•3 comments

Books Should Update as Software

6•fullstackragab•1d ago•8 comments

Ask HN: What is your prediction for the price of computer parts in 2026?

2•wand3r•1d ago•2 comments

Ask HN: What do you think of reality check based behaviour corrector app?

2•tbhaxor•1d ago•0 comments

Ask HN: Help with LLVM

4•kvthweatt•2d ago•0 comments

Tell HN: Perplexity Has Unspecified Character Limits for Session Export

3•eth0up•16h ago•1 comments

Ask HN: Where else do you keep up-to-date?

7•throwaway132448•1d ago•1 comments

I optimised my vibe coding tech stack cost to $0

11•udit_50•2d ago•14 comments

Ask HN: How did you learn to code?

35•chistev•4d ago•81 comments