frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•10mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Tell HN: docker pull fails in spain due to football cloudflare block

351•littlecranky67•5h ago•153 comments

Tell HN: OpenAI silently removed Study Mode from ChatGPT

144•smokel•4h ago•51 comments

Ask HN: Do you trust AI agents with API keys / private keys?

10•devendra116•11h ago•23 comments

Ask HN: Anyone using Nostr as a lightweight back end/DB for rapid prototyping?

4•wasimsk•12h ago•0 comments

Ask HN: Any interesting niche hobbies?

478•e-topy•6d ago•726 comments

Ask HN: What should I do with my app? 130 downloads 3 real subscribers

3•oyaa52•12h ago•6 comments

Is the pitch deck culture making founders worse at building businesses?

15•chinhqtran•1d ago•6 comments

Ask HN: What are you building that's not AI related?

144•meander_water•3d ago•194 comments

Ask HN: Former grok-code-fast-1 users, what coding model are you using now?

2•whycombinetor•15h ago•3 comments

Ask HN: Hiring in the age of AI-assisted coding: what works?

26•nitramm•1d ago•16 comments

Ask HN: Best books on building a programming language

16•ezzato•1d ago•8 comments

Ask HN: Agentic Permutation of Testing Paths In A System

4•davidajackson•21h ago•0 comments

Ask HN: Has anyone reconsidered Antivirus software after recent security news?

6•pants2•20h ago•3 comments

Ask HN: How do you handle marketing as a solo technical founder?

146•lazarkap•5d ago•107 comments

Tor Browser on Android leaks IP in desktop mode

13•shchess•1d ago•2 comments

Any Open Source projects in need of documentation writer?

21•tree666•2d ago•13 comments

Ask HN: Why Databases Instead of Filesystem?

12•uticus•2d ago•20 comments

Do founders' political views affect how you see a product?

4•rishikeshs•1d ago•3 comments

I collected startup ideas. It changed how I think about ideas completely

9•vibecoder21•1d ago•11 comments

Zooming UIs in 2026: Prezi, impress.js, and why I built something different

101•tinchox6•5d ago•44 comments

Is VC the new PMF strategy?

3•networkOne•1d ago•5 comments

Ask HN: Should AI credits be refunded on mistakes?

19•ed_elliott_asc•4d ago•20 comments

Open Source card game cuttle.cards has its world championship Saturday at 1pm ET

4•aleph_one•1d ago•0 comments

Hybrid Attention

40•JohannaAlmeida•5d ago•9 comments

Ask HN: How do you manage your digital legacy for after you die?

15•orbanlevi•4d ago•15 comments

Ask HN: Local-first meetings recorder and transcriber?

7•dandaka•3d ago•1 comments

You've reached the end!