frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•9mo ago
give https://pg.llmwhisperer.unstract.com/ a try

AI tools are making me lose interest in CS fundamentals

46•Tim25659•2h ago•37 comments

Ask HN: How is AI-assisted coding going for you professionally?

270•svara•12h ago•463 comments

Ask HN: How is your team collaborating while working with coding agents?

2•wek•3h ago•1 comments

Ask HN: How to Learn C++ in 2026?

8•creatorcoder•6h ago•8 comments

I'm 60 years old. Claude Code killed a passion

191•fred1268•15h ago•156 comments

Tons of new LLM bot accounts here

12•koolala•9h ago•13 comments

Ask HN: Have you successfully treated forward head posture ("nerd neck")?

48•trashymctrash•1d ago•39 comments

Ask HN: What breaks first when your team grows from 10 to 50 people?

8•hariprasadr•17h ago•6 comments

Built a 1.3M-line agent-native OS in Rust while homeless. What now?

9•jamieoglindsey•12h ago•10 comments

I traced $2B in nonprofit grants for Meta and Age Verification lobbying

91•theseusares•2d ago•20 comments

Ask HN: What was it like for programmers when spreadsheets became ubiquitous?

11•yodaiken•1d ago•11 comments

Tell HN: iPhone 6s still getting security updates

6•uticus•1d ago•2 comments

Ask HN: Why is there a lack of useful use cases for OpenClaw?

3•nazbasho•8h ago•5 comments

Ask HN: Med student interested in bci startups..where do I start?

2•Sxouterred•21h ago•3 comments

Ask HN: Got cancer, a new job,new boss in less than a year What do I do now?

19•Goleniewski•2d ago•18 comments

Ask HN: Why can't we just make more RAM?

25•chatmasta•2d ago•21 comments

Ask HN: What will you be doing for the next 10 years?

8•smarri•10h ago•9 comments

How not to fork an open source project

5•jsattler•1d ago•1 comments

MiniMax M2.5 is trained by Claude Opus 4.6?

10•Orellius•2d ago•10 comments

Ask HN: How do you use Coding Agents/CLIs out of coding?

4•arbayi•1d ago•6 comments

X is selling existing users' handles

197•hac•4d ago•91 comments

Ask HN: What's your biggest pain point when joining a new developer team?

8•KevStatic•2d ago•15 comments

Toolpack SDK, an Open Source TypeScript SDK for Building AI-Powered Applications

2•sajeerzeji•1d ago•1 comments

Prompt to make Claude more autonomous in web dev

5•louison11•1d ago•1 comments

Ask HN: Why have co-ops never played a major role in tech?

14•AbstractH24•2d ago•8 comments

Claude broke a ZIP password in a smart way

7•jgrahamc•1d ago•2 comments

Ask HN: 100k/year individual token usage?

7•alecsmart1•1d ago•4 comments

Ask HN: Is Claude down again?

86•coderbants•4d ago•73 comments

Ask HN: Would this eliminate bots for good?

3•piratesAndSons•1d ago•12 comments

Generate tests from GitHub pull requests

7•Aamir21•2d ago•7 comments