frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•9mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•9mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•9mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: What Are You Working On? (March 2026)

263•david927•23h ago•970 comments

Ask HN: Can I repurpose a Bluetooth voice remote as input device for a PC?

12•albert_e•1d ago•18 comments

Ask HN: Is GitHub getting less reliable, or is it just me?

5•_pdp_•1h ago•3 comments

Ask HN: Please restrict new accounts from posting

682•Oras•1d ago•481 comments

Ask HN: How to be alone?

637•sillysaurusx•1d ago•527 comments

Ask HN: Most beautiful personal blog UI you have ever seen?

121•ms7892•1d ago•47 comments

Code-review-graph: persistent code graph that cuts Claude Code token usage

2•tirthkanani•4h ago•0 comments

Ask HN: Who Needs Help?

11•surprisetalk•7h ago•5 comments

Ask HN: Which book are you reading these days?

4•chistev•2h ago•6 comments

A job ad for Agentic AI Advocate

4•greenpinia•8h ago•1 comments

Ask HN: Are showlang and thelang HN endpoints not being maintained?

3•freakynit•9h ago•0 comments

Tell HN: I'm 60 years old. Claude Code has re-ignited a passion

1058•shannoncc•2d ago•962 comments

All tmux sessions as a single terminal

2•lygten•18h ago•1 comments

OpenAI might end up on the right side of history

12•shoman3003•1d ago•10 comments

Ask HN: Are we going to see more job postings asking for only agentic coding?

5•ronbenton•22h ago•8 comments

Ask HN: How are you handling persistent memory across local Ollama sessions

5•null-phnix•1d ago•0 comments

Ask HN: How are you adapting your career in this AI era?

9•sarthaksaxena•10h ago•5 comments

Ask HN: How are you monitoring AI agents in production?

4•jairooh•1d ago•5 comments

Ask HN: Anyone else feel this community has changed recently?

53•kypro•2d ago•29 comments

Whisker – Self hosted e-commerce cart, pure PHP, zero dependencies

7•eLohith•2d ago•3 comments

Ask HN: What is your oldest living presence on the World Wide Web?

2•dhruv3006•12h ago•2 comments

I replaced my freelance SaaS stack with 5 single-file HTML tools

4•AnnSri•1d ago•2 comments

Ask HN: Why Is Phil Wang / Lucidrains Off GitHub?

3•vessenes•1d ago•4 comments

Tell HN: The proposed KIDS Act (HR 7757) effectively mandates biometric browsing

19•fokdelafons•3d ago•0 comments

PhD interrupted by personal safety issues, now publication record is thin

4•qthrwaway•2d ago•2 comments

Add llms.txt and fix robots.txt for AI agent discoverability

3•nishiohiroshi•1d ago•2 comments

Ask HN: Do You Enjoy Your Career in Tech Nowadays?

29•karakoram•4d ago•31 comments

Ask HN: Last time you wrote code?

5•blinkbat•1d ago•15 comments

How do teams prevent duplicate LLM API calls and token waste?

3•cachelogic•2d ago•1 comments

What Will Happen to Android?

4•MrLey•2d ago•3 comments