frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

AI coding is a nightmare. Am I the only one experiencing this?

47•sollawen•8h ago•29 comments

Ask HN: Since when does Craigslist's front page have emojis?

36•argee•2d ago•33 comments

Ask HN: Who is hiring? (July 2026)

238•whoishiring•1d ago•282 comments

Ask HN: Why are so many "AI evangelists" posting such insufferable content?

45•seattle_spring•17h ago•29 comments

Ask HN: Who wants to be hired? (July 2026)

145•whoishiring•1d ago•405 comments

AskHN: Using 'claude -p' for running Mr.Jassy - AWS butler agent

2•anoop_kumar•7h ago•0 comments

Reducing AI costs with smart pricing

2•millereffect•8h ago•0 comments

Tell HN: Old Reddit now requires login

81•jay_kyburz•1d ago•17 comments

Ask HN: Once you make your money from vibe coding innumerable products, then?

2•keepamovin•9h ago•8 comments

Ask HN: How do you get your open-source product good traction?

3•akarshhegde18•9h ago•3 comments

Burned out, how do I get out of the rut?

6•brandgefahr•14h ago•15 comments

Tell HN: Installing Cursor on iOS irreversibly changes your privacy settings

246•zkldi•2d ago•34 comments

Ask HN: Best Local LLM Setup for a 128GB M4 Max Mac Studio?

6•linzhangrun•9h ago•0 comments

Rust sort_unstable_by with more complex closure unexpectedly shrunk binary

5•tracyspacy•21h ago•2 comments

Seattle Just Had an Earthquake

11•tobinfekkes•1d ago•7 comments

I'm opening VSCode less and less every day

19•othmanosx•1d ago•18 comments

Stopping token burn because of agents sticking in a loop

2•driftguard•16h ago•0 comments

Ask HN: Will AI force CS to focus on what to build instead of how to build it?

2•amichail•17h ago•2 comments

Should I go open-source or give free trial and then charge the users?

3•akarshhegde18•17h ago•4 comments

Ask HN: Secure wrapper for coding agents?

20•rjzzleep•2d ago•19 comments

Are Claude models broken with the Fable 5 update?

7•jackjd•1d ago•2 comments

Fire and Coast Fire Tracker · Streamlit

3•Rossklot•8h ago•0 comments

Ask HN: Where is the programming profession going?

165•syntaxbush•1w ago•183 comments

Ask HN: Why aren't companies hoarding AI talent?

5•playorizaya•22h ago•26 comments

Ask HN: What things might help me to become inference engineer?

4•chalshik•1d ago•3 comments

Ask HN: Is anyone experimenting with different ways of using LLMs for coding?

33•yehiaabdelm•5h ago•55 comments

Ask HN: Line by Line Agentic Coding

6•notshore•1d ago•5 comments

Tell HN: I'm not excited for Fable and am disappointed in Karpathy

6•behnamoh•1d ago•10 comments

Ask HN: What does a good day at work look like in the AI era?

2•akashwadhwani35•5h ago•1 comments

Ask HN: What do you use computer mode for?

4•aryamaan•1d ago•0 comments