frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•11mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•11mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•11mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: Are cloud coding agents useful in real workflows yet?

3•Rperry2174•3h ago•2 comments

Recommended GPU Repairshop in Europe (Germany)

36•DogRunner•3d ago•13 comments

How do you overcome the "build it and they will come" trap?

2•entreel•4h ago•2 comments

Ask HN: How to solve the cold start problem for a two-sided marketplace?

143•alegd•2d ago•160 comments

Ask HN: How did you land your first projects as a solo engineer/consultant?

295•modelcroissant•3d ago•144 comments

OpenClaw stats don't add up

9•iliaov•10h ago•5 comments

Ask HN: What Would Make Stack Overflow Great Again?

9•nnurmanov•12h ago•10 comments

Ask HN: What skills are future proof in an AI driven job market?

28•sunny678•1d ago•59 comments

Tell HN: My open-source project hit 5k registered users

12•darkhorse13•18h ago•4 comments

Anthropic bans orgs without warning

27•alpinisme•20h ago•13 comments

Ask HN: Are there any engineering orgs that use incentives?

2•jppope•16h ago•5 comments

GPT 5.5 Released in Codex

9•zuzululu•11h ago•2 comments

Ask Anthropic: Requesting clarity on Claude -p situation

5•andai•20h ago•0 comments

Ask HN: Building a solo business is impossible?

72•fnoef•5d ago•95 comments

My file access workaround for cron in Tahoe

4•noduerme•1d ago•0 comments

Opus 4.7 vs. 4.6 after 3 days of real coding side by side from my actual session

14•agentseal•2d ago•6 comments

Tell HN: I'm sick of AI everything

288•jonthepirate•14h ago•165 comments

Getting back into photography, ditching the phone camera in 2026?

7•eigenhombre•1d ago•6 comments

Claude Code is no longer a part of the Pro plan

16•csoham•16h ago•3 comments

Ask HN: Is GitHub's #1 repo fake? build-your-own-x 491K stars

10•throwawaysoso•1d ago•2 comments

Warning: If you search "Claude Desktop" on Google, the 2nd ad result is a scam

7•diogenescynic•2d ago•3 comments

Ask HN: What makes a good Product Manager

9•chairhairair•3d ago•10 comments

Ask HN: May be a basic question, but how can I use AI well?

9•jdw64•3d ago•4 comments

Ask HN: How did you get your first users with zero audience?

19•arikusi•5d ago•20 comments

Why don't we just ask AI to write assembler?

14•canterburry•4d ago•16 comments

Ask HN: Anyone know of that "levels of AI programming" blog post?

9•tuvix•3d ago•7 comments

You've reached the end!