frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: What home printer do you use/recommend?

10•niyazpk•46m ago•6 comments

Ask HN: Are people generally interested using LLMs for learning purposes?

4•iknownthing•3h ago•8 comments

Ask HN: Anthropic banned me from using Claude Code and I don't know what to do

66•ayi•15h ago•80 comments

Ask HN: Am I missing something with AI

4•vasko•8h ago•8 comments

Overfitted a 900KB Transformer to Compress a 100MB CSV into 7MB

5•spidy__•9h ago•2 comments

Ask HN: Do you have an unusual income source

45•xupybd•1d ago•33 comments

Ask HN: New clean macOS install. Must-have apps? Best browser?

14•simonebrunozzi•10h ago•17 comments

Tell HN: I never bought anything from clicking on a paid ad

17•julienreszka•20h ago•16 comments

Ask HN: Will programmers write more efficient code during the memory shortage?

152•amichail•3d ago•243 comments

Open source, global vs. proprietary but for US in US, which is fundable in SaaS?

4•avijeetsingh16•16h ago•0 comments

Ask HN: Is anyone using the A2A protocol?

94•asim•5d ago•44 comments

Ask HN: What tools are you using for AI-assisted code review?

25•agos•5d ago•26 comments

Ask HN: What is your opinion on TUI applications

8•po1nt•1d ago•9 comments

Ask HN: What did you find out or explore today?

5•blahaj•1d ago•5 comments

Ask HN: Are people optimistic about the future?

40•JohnDSDev•3d ago•83 comments

Ask HN: Are you being "529 Overloaded" by Anthropic too?

8•hmokiguess•1d ago•8 comments

Ask HN: Fda.gov Down for You?

2•jmount•1d ago•2 comments

Ask HN: How close are we to local LLMs being useful? What's the impact?

6•AbstractH24•1d ago•6 comments

Ask HN: What would justify writting an OS kernel in 2026?

5•alonsovm44•1d ago•7 comments

Ask HN: I'm lost. How can I define ICP (Ideal Customer Profile)?

8•snowhy•5d ago•6 comments

Norrin – Git/ diff control in Claude Code

4•gagewoodard•2d ago•1 comments

Ask HN: Are You a Workaholic?

5•julienreszka•2d ago•5 comments

Ask HN: Has Codex gotten slower recently?

6•aurenvale•19h ago•1 comments

My Opinion on RL

3•umjunsik132•1d ago•1 comments

GitHub Banned All CI for Our (OSS) Org Because of Bad Drive-By Contributors

9•BlueMatt•1d ago•4 comments

Ask HN: How should I convert Microsoft Word documents to Markdown?

5•lkrubner•2d ago•7 comments

Ask HN: After you ship a feature, what happens to what you learned?

10•gaggle_dk•2d ago•13 comments

Ask HN: What technique do you use to make Claude Code deterministic?

8•hbarka•3d ago•11 comments

Ask HN: What do you care about? What is your joy and purpose?

12•bix6•3d ago•21 comments

You've reached the end!