frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: How to get a non-technical friend into tech?

2•yesitcan•6m ago•0 comments

Ask HN: What metrics do you track when building agents?

2•rstagi•2h ago•0 comments

Ask HN: Is anyone experimenting with different ways of using LLMs for coding?

194•yehiaabdelm•2d ago•194 comments

Ask HN: Good fast IDE for reading and navigating code in multiple languages

5•akkad33•11h ago•7 comments

Tell HN: FileVault does not protect Wi-Fi passwords on macOS 26

3•turbidimeter•8h ago•1 comments

LongCat-2.0

5•rika321•12h ago•0 comments

Ask HN: Who is hiring? (July 2026)

243•whoishiring•4d ago•326 comments

Happy Independence Day

33•GauntletWizard•1d ago•8 comments

Ask HN: Who wants to be hired? (July 2026)

150•whoishiring•4d ago•461 comments

Ask HN: Since when does Craigslist's front page have emojis?

39•argee•4d ago•33 comments

Ask HN: America turns 250 today. What does it mean to you?

12•abixb•1d ago•6 comments

Ask HN: How Do You Connect OpenAI Secure MCP Tunnel with Claude Desktop

3•mcpzero•15h ago•0 comments

Tell HN: Megalodon.jp is faster than archive.today and doesn't require reCAPTCHA

5•Cider9986•22h ago•2 comments

I can build anything, but only the void sees it

9•urbanogt5•1d ago•22 comments

Ask HN: Why are so many "AI evangelists" posting such insufferable content?

66•seattle_spring•3d ago•37 comments

Tell HN: Installing Cursor on iOS irreversibly changes your privacy settings

248•zkldi•5d ago•34 comments

How many failed startups have you launched?

18•steelebillings•2d ago•12 comments

Where can I find or get in contact with farmers specifically in the US?

6•strapchay•1d ago•6 comments

Tell HN: Old Reddit now requires login

87•jay_kyburz•4d ago•18 comments

I built a environment reloader for Windows Shells

3•byjonas•1d ago•0 comments

Retrieval is not the future of AI – if it was, Google would have won already

4•lamprouge•1d ago•2 comments

Ask HN: Where are the good search engines for mathematical formulas?

3•lo0dot0•1d ago•1 comments

Fable 5. Safety Taken to an Extreme

10•sergeysmirnov•1d ago•8 comments

Tell HN: Fewer PRs done with proper prompting, review, and refinement wins

7•tomerbd•2d ago•4 comments

Ask HN: When will the stock market crash?

12•roschdal•22h ago•18 comments

Seattle Just Had an Earthquake

13•tobinfekkes•3d ago•9 comments

SponsorBlock Critical Security Vulnerabilities

3•IDIRIS•2d ago•3 comments

I'm opening VSCode less and less every day

21•othmanosx•3d ago•20 comments

Claude Fable is useless for bioinformaticians

6•iqbal1980•2d ago•6 comments

You've reached the end!