frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•10mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Any Open Source projects in need of documentation writer?

12•tree666•2h ago•7 comments

Ask HN: What are you building that's not AI related?

116•meander_water•17h ago•140 comments

Ask HN: What would you do with an AI model capable of continuous learning?

4•jballanc•2h ago•3 comments

Ask HN: Any interesting niche hobbies?

448•e-topy•4d ago•679 comments

IMDB created my account for merely visiting the site

8•astr0n0m3r•4h ago•3 comments

Ask HN: How do you handle marketing as a solo technical founder?

132•lazarkap•3d ago•99 comments

Ask HN: Should AI credits be refunded on mistakes?

16•ed_elliott_asc•1d ago•16 comments

Ask HN: Local-first meetings recorder and transcriber?

3•dandaka•15h ago•1 comments

Ask HN: How do you manage your digital legacy for after you die?

11•orbanlevi•1d ago•13 comments

Zooming UIs in 2026: Prezi, impress.js, and why I built something different

99•tinchox6•3d ago•44 comments

Hybrid Attention

40•JohannaAlmeida•2d ago•9 comments

Ask HN: Founders/investors, what AI bet you made in 2022 and how it is going?

6•kjok•1d ago•1 comments

Engineering departments from China laid off at Red Hat

13•NBS060•5h ago•1 comments

Early precursor signals observed before incidents (RTT/DNS/HTTP telemetry)

2•ravensystems•1d ago•0 comments

Anthropic banned Pi, a third party harness

3•rapiz•1d ago•6 comments

Ask HN: S3(AWS) vs R2(CF)–Which is better?

4•wasimsk•23h ago•4 comments

GPT 5.4 in practice – Stinks?

9•sjt-at-rev•2d ago•13 comments

Claude Code limits are starting to feel like a psychological trick

7•trinsic2•3d ago•13 comments

Ask HN: Alternatives to Claude (Code)?

13•vixalien•3d ago•13 comments

Compact multi-port network setup (2.5G / 10G / SFP+) – looking for feedback

3•Qotom•2d ago•2 comments

Ask HN: Where are all the disruptive software that AI promised?

23•p-o•4d ago•21 comments

Ask HN: How are you orchestrating multi-agent AI workflows in production?

8•swrly•3d ago•10 comments

Free models you can use with your OpenClaw (no credit card needed)

3•stosssik•2d ago•1 comments

Upwork Inc. violates its own DMARC and SPF policy

8•tmcdos•3d ago•7 comments

You've reached the end!