frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•10mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: Any interesting niche hobbies?

457•e-topy•4d ago•698 comments

Ask HN: What are you building that's not AI related?

126•meander_water•1d ago•159 comments

Any Open Source projects in need of documentation writer?

19•tree666•14h ago•13 comments

Ask HN: Are you encountering AI-related questions in the hiring market?

5•somthingwrong•5h ago•2 comments

Yuku – A fast, spec-compliant JavaScript parser written in Zig

3•arshadyaseen•12h ago•0 comments

Ask HN: What would you do with an AI model capable of continuous learning?

4•jballanc•14h ago•6 comments

Ask HN: How do you handle marketing as a solo technical founder?

137•lazarkap•3d ago•103 comments

IMDB created my account for merely visiting the site

11•astr0n0m3r•16h ago•3 comments

Ask HN: Should AI credits be refunded on mistakes?

16•ed_elliott_asc•2d ago•16 comments

Ask HN: How do you manage your digital legacy for after you die?

13•orbanlevi•2d ago•14 comments

Zooming UIs in 2026: Prezi, impress.js, and why I built something different

100•tinchox6•3d ago•44 comments

Hybrid Attention

40•JohannaAlmeida•3d ago•9 comments

Ask HN: Local-first meetings recorder and transcriber?

3•dandaka•1d ago•1 comments

Ask HN: Founders/investors, what AI bet you made in 2022 and how it is going?

6•kjok•1d ago•1 comments

GPT 5.4 in practice – Stinks?

9•sjt-at-rev•2d ago•14 comments

Early precursor signals observed before incidents (RTT/DNS/HTTP telemetry)

2•ravensystems•2d ago•0 comments

Anthropic banned Pi, a third party harness

3•rapiz•2d ago•6 comments

Ask HN: Alternatives to Claude (Code)?

14•vixalien•3d ago•13 comments

Ask HN: Where are all the disruptive software that AI promised?

23•p-o•4d ago•22 comments

Ask HN: S3(AWS) vs R2(CF)–Which is better?

4•wasimsk•1d ago•4 comments

Compact multi-port network setup (2.5G / 10G / SFP+) – looking for feedback

3•Qotom•3d ago•2 comments

Ask HN: How are you orchestrating multi-agent AI workflows in production?

8•swrly•4d ago•10 comments

You've reached the end!