frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

57•iliashad•1h ago•8 comments

FTX's former Anthropic stake would be worth about $75B at today's valuation

2•adam_rida•27m ago•2 comments

Ask HN: What are you working on? (June 2026)

11•david927•53m ago•14 comments

Ask HN: How are thinking efforts implemented?

95•simianwords•1w ago•30 comments

I'm Eric Ries, author of "The Lean Startup" and new book "Incorruptible" – AMA

796•eries•4d ago•572 comments

Is there a name for the type of comments agents add where they leak the prompt?

15•xdennis•16h ago•8 comments

Ask HN: Favorite text heavy blogs that are a joy to read?

112•joshmarinacci•4d ago•28 comments

Ask HN: Why hasn't there been a real competitor to Ticketmaster yet?

262•mdni007•5d ago•239 comments

AWS Bedrock to require sharing data with Anthropic for Mythos and future models

423•TomAnthony•4d ago•253 comments

Ask HN: Want to build something open source on nights and weekends together?

38•vira28•3d ago•15 comments

We should start measuring knowledge debt like the way we do for tech debt

5•ciwolex•3h ago•0 comments

Notes on DeepSeek

206•vinhnx•4d ago•140 comments

Ask HN: Would it be useful to have a slop button in addition to flag?

35•BugsJustFindMe•3d ago•23 comments

Story of How Im Running an Unlimited $6/Month AI Provider on 4x RTX 3090s

6•yolo-auto•11h ago•2 comments

Ask HN: Why is the best way to find a job as a Software Engineer in 2026?

6•Ako03•4h ago•8 comments

Ask HN: Are most corporate SWE jobs performative?

247•hnthrow10282910•4d ago•282 comments

Ask HN: How do you get into a flow state when using AI to code?

90•kilroy123•3d ago•116 comments

Ask HN: What are tools you have made for yourself since the advent of AI?

441•aryamaan•5d ago•769 comments

I procrastinate by building tools to stop me from procrastinating: A sad story

20•thisislorenzov•3d ago•10 comments

Ask HN: Are you still using a Vision Pro?

171•y1n0•4d ago•214 comments

Ask HN: Which cheap Chinese LLM are you using?

13•linzhangrun•14h ago•4 comments

Ask HN: Did we witness the "Trinity moment" for AI?

18•vld_chk•22h ago•26 comments

Tell HN: iOS devs, get back lots of disk space: xcrun simctl delete unavailable

4•amichail•22h ago•0 comments

Ask HN: Is anyone shorting the overspend in AI yet?

17•ggm•3d ago•15 comments

What if we legally required politicians to work regular jobs 2 months a year?

11•ekoeko•1d ago•13 comments

Ask HN: What is the AI adoption approach at your org?

5•iExploder•1d ago•7 comments

Did anyone went to YC directly from Sri Lanka?

3•geethikaisuru•1d ago•0 comments

Ask HN: Is there a metric for AI code quality?

6•fractalf•3d ago•5 comments

Ask HN: I Need Help for a Product

8•memoryleakgame•2d ago•5 comments

Ask HN: What is your blog and why should I read it?

8•chistev•9h ago•5 comments