frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: After you ship a feature, what happens to what you learned?

3•gaggle_dk•29m ago•1 comments

Ask HN: What is your #1 practical lesson or "aha" moment from coding with AI?

5•johndavid9991•2h ago•8 comments

Ask HN: Will programmers write more efficient code during the memory shortage?

141•amichail•1d ago•235 comments

Ask HN: Do you use Claude Code, Codex, or something else?

4•JohnDSDev•3h ago•9 comments

Ask HN: Is anyone using the A2A protocol?

92•asim•2d ago•41 comments

Ask HN: What technique do you use to make Claude Code deterministic?

3•hbarka•7h ago•5 comments

Ask HN: What do you care about? What is your joy and purpose?

7•bix6•9h ago•17 comments

Ask HN: How to handle kernel struct changes (e.g. iov_iter) in eBPF?

3•morolis•9h ago•2 comments

Ask HN: What tools are you using for AI-assisted code review?

21•agos•2d ago•21 comments

Ask HN: Need advice on distributing and testing what I build

3•darth-pixit•12h ago•2 comments

Ask HN: Due to spam on GitHub, what platforms can I move my projects?

54•ciwolex•5h ago•55 comments

Forked CozoDB to give agents cognitive primitives

3•shanrizvi•20h ago•0 comments

Ask HN: What is the coolest tech progress outside AI?

13•vantareed•1d ago•9 comments

Ask HN: Is anyone else leaving AUR?

7•lordkrandel•1d ago•6 comments

Ask HN: What's a simple app you'd build if you had a weekend?

4•akashwadhwani35•18h ago•7 comments

Ask HN: I'm lost. How can I define ICP (Ideal Customer Profile)?

5•snowhy•2d ago•6 comments

Ask HN: Is there a way to stop the animated Google Doodles?

12•arnejenssen•2d ago•13 comments

Ask HN: Open-Source Intelligence

3•silent_butagrim•1d ago•5 comments

Ask HN: Is there a recognized standard for swarm intelligence benchmarking?

5•stephanieriggs•1d ago•1 comments

Self-adapting and mutating LLM based viruses/worms

3•rozumbrada•1d ago•4 comments

Ask HN: How do you effectively communicate or present?

10•hnthrow10282910•2d ago•7 comments

Ask HN: Conflicted about founding engineer role

8•gondolin1683•2d ago•18 comments

Ask HN: Do you find vibe coding / agentic engineering to be fulfilling?

10•uejfiweun•2d ago•13 comments

Trillions of dollars spent just to work on customer services?

8•YihaoZhang•1d ago•3 comments

Ask HN: Using OPA/Rego to secure MCP tool execution. Does it make sense?

5•wmolino•17h ago•1 comments

Ask HN: What's a prompt you've written that you're genuinely proud of?

11•akashwadhwani35•2d ago•7 comments

Ask HN: Has anyone had success with SBIR grants and what is the process like?

11•lyfeninja•2d ago•8 comments

Ask HN: Are other people seeing a spike in IT problems with businesses?

14•PaulHoule•3d ago•11 comments

Anthropic pauses credit change for Claude Code

36•fabianlindfors•5d ago•12 comments

Meetup.com login appears to be exceeding its reCAPTCHA Enterprise quota

4•infl8ed•1d ago•0 comments