frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Ask HN: Open-Source Intelligence

2•silent_butagrim•1h ago•0 comments

Ask HN: Is anyone using the A2A protocol?

86•asim•1d ago•41 comments

Ask HN: What tools are you using for AI-assisted code review?

21•agos•22h ago•17 comments

Self-adapting and mutating LLM based viruses/worms

3•rozumbrada•3h ago•3 comments

Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?

1303•cloudking•4d ago•554 comments

Trillions of dollars spent just to work on customer services?

8•YihaoZhang•6h ago•2 comments

Every independent nation should work on three things

4•manishtiwari09•8h ago•3 comments

Ask HN: Is anyone else leaving AUR?

4•lordkrandel•8h ago•4 comments

Ask HN: How are thinking efforts implemented?

104•simianwords•1w ago•31 comments

Anthropic confident of re-enabling Mythos, Fable 5 access 'in coming days'

11•getbowtied•20h ago•6 comments

I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

436•iliashad•4d ago•115 comments

Ask HN: I'm lost. How can I define ICP (Ideal Customer Profile)?

4•snowhy•21h ago•6 comments

Ask HN: Is there a way to stop the animated Google Doodles?

10•arnejenssen•22h ago•12 comments

Ask HN: What are you working on? (June 2026)

310•david927•4d ago•1132 comments

Ask HN: Conflicted about founding engineer role

6•gondolin1683•21h ago•17 comments

Ask HN: Do you find vibe coding / agentic engineering to be fulfilling?

8•uejfiweun•21h ago•10 comments

Ask HN: Why hasn't there been a real competitor to Ticketmaster yet?

264•mdni007•1w ago•240 comments

Ask HN: How do you find new books to read?

4•ahmedfromtunis•18h ago•5 comments

Ask HN: How do you effectively communicate or present?

6•hnthrow10282910•19h ago•5 comments

Ask HN: Please recommend sci-fi books similar to "children of time" or "anathem"

4•rishabhpoddar•8h ago•2 comments

Ask HN: What's a prompt you've written that you're genuinely proud of?

10•akashwadhwani35•1d ago•5 comments

Ask HN: Opus and regression with patterns not included in trainng data

2•dleech•21h ago•4 comments

Ask HN: Favorite text heavy blogs that are a joy to read?

120•joshmarinacci•1w ago•30 comments

AWS Bedrock to require sharing data with Anthropic for Mythos and future models

427•TomAnthony•1w ago•255 comments

Ask HN: Has anyone had success with SBIR grants and what is the process like?

11•lyfeninja•1d ago•8 comments

Ask HN: Do we even need code anymore?

5•lasky•22h ago•17 comments

How much $ you spend for AI to code?

4•raghuu•23h ago•5 comments

Ask HN: Best resources for learning how to build a forum back end?

3•jupr•1d ago•3 comments

Ask HN: Whats the best and small open source model?

3•hairymouse•1d ago•3 comments

Ask HN: Are other people seeing a spike in IT problems with businesses?

14•PaulHoule•2d ago•11 comments
Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try