frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

Gemini models increasingly stucking in thinking loop

3•StizzurpXDD•15m ago•1 comments

PDFWix Free online PDF and document conversion Tool

2•PDFWix•25m ago•0 comments

Open source, global vs. proprietary but for US in US, which is fundable in SaaS?

3•avijeetsingh16•2h ago•0 comments

Tell HN: Happy Fathers Day

334•consumer451•1d ago•56 comments

Tell HN: I never bought anything from clicking on a paid ad

10•julienreszka•7h ago•5 comments

Ask HN: Do you have an unusual income source

42•xupybd•1d ago•29 comments

Ask HN: Anthropic banned me from using Claude Code and I don't know what to do

51•ayi•2h ago•49 comments

Ask HN: What did you find out or explore today?

5•blahaj•13h ago•5 comments

Ask HN: Will programmers write more efficient code during the memory shortage?

149•amichail•3d ago•243 comments

Ask HN: Fda.gov Down for You?

2•jmount•15h ago•2 comments

Ask HN: Has Codex gotten slower recently?

6•aurenvale•6h ago•1 comments

Ask HN: What is your opinion on TUI applications

6•po1nt•22h ago•8 comments

Ask HN: Is anyone using the A2A protocol?

94•asim•5d ago•44 comments

Ask HN: How close are we to local LLMs being useful? What's the impact?

6•AbstractH24•19h ago•6 comments

Ask HN: Are you being "529 Overloaded" by Anthropic too?

8•hmokiguess•1d ago•8 comments

Ask HN: What tools are you using for AI-assisted code review?

25•agos•4d ago•26 comments

Ask HN: Are people optimistic about the future?

38•JohnDSDev•2d ago•74 comments

Norrin – Git/ diff control in Claude Code

4•gagewoodard•1d ago•1 comments

My Opinion on RL

3•umjunsik132•1d ago•1 comments

GitHub Banned All CI for Our (OSS) Org Because of Bad Drive-By Contributors

9•BlueMatt•1d ago•4 comments

Ask HN: Are You a Workaholic?

5•julienreszka•1d ago•4 comments

Ask HN: What would justify writting an OS kernel in 2026?

4•alonsovm44•1d ago•6 comments

Ask HN: I'm lost. How can I define ICP (Ideal Customer Profile)?

8•snowhy•4d ago•6 comments

Ask HN: How should I convert Microsoft Word documents to Markdown?

5•lkrubner•1d ago•7 comments

Ask HN: After you ship a feature, what happens to what you learned?

10•gaggle_dk•2d ago•13 comments

Ask HN: What technique do you use to make Claude Code deterministic?

7•hbarka•2d ago•11 comments

Ask HN: What do you care about? What is your joy and purpose?

10•bix6•2d ago•20 comments

Ask HN: Favorite aspects of Cocoa/NeXTSTEP for app dev?

6•elcritch•2d ago•0 comments

Ask HN: What are your favourite Hacker News comments?

4•Imustaskforhelp•2d ago•4 comments

Ask HN: Any AI native Anki alternatives?

5•shadag•23h ago•3 comments