frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•9mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•9mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•9mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Aura-State: Formally Verified LLM State Machine Compiler

9•rohanmunshi08•3d ago•2 comments

We don't need continual learning for AGI. What top labs are currently doing

3•kok14•1h ago•0 comments

An offline map using OruxMaps(satellite,routing,3D terrain,GPS and POI)

2•supergoogler•2h ago•0 comments

Ask HN: Who wants to be hired? (March 2026)

124•whoishiring•2d ago•365 comments

Ask HN: Has anyone noticed the fear-driven prompt suggestions that GPT5.3 makes?

8•cedarscarlett•9h ago•6 comments

Ask HN: Who is hiring? (March 2026)

243•whoishiring•2d ago•357 comments

Tell HN: Digital Ocean has run out of GPU droplets

13•nathannaveen•18h ago•2 comments

Stathat Is Shutting Down

6•jervant•11h ago•2 comments

Ask HN: Maintainers, do LLM-only users often clutter your issues/PRs?

6•lucrbvi•17h ago•7 comments

Ask HN: Anyone have experience making physical toys that you've sold?

2•general_reveal•12h ago•1 comments

Ask HN: How are you all staying sane?

146•throwaway53463•3d ago•149 comments

I lost my ability to learn anything new because of AI and I need your opinions

18•dokdev•1d ago•27 comments

Ask HN: If your project is free, what are you building and why keep it free?

5•LeanVibe•18h ago•16 comments

Ask HN: What's your experience self-hosting in 2026?

25•rustcore•1d ago•9 comments

Tell HN: I got Claude Max for my open source project

9•asim•1d ago•5 comments

Ask HN: What sources like HN do you consume?

54•DavidHaerer•3d ago•35 comments

Ask HN: Why has ChatGPT disabled links to websites?

6•krschacht•20h ago•4 comments

Ask HN: What will OpenAI employees do now who have signed notdividedorg petition

17•Imustaskforhelp•1d ago•16 comments

Ask HN: What Online LLM / Chat do you use?

12•ddxv•2d ago•18 comments

Ask HN: How many of you hold an amateur radio license in your country?

3•ToddWBurgess•1d ago•3 comments

Ask HN: What prompt do you use to get Claude to consistently render LaTeX?

5•TobyTheCamel•1d ago•6 comments

Whats Up with Claude Lately?

19•mech422•2d ago•14 comments

Ask HN: Would engineers be interested in a technical prep consultant?

8•TechPrepper•2d ago•9 comments

Ask HN: How is Claude agent experience in Xcode 26.3?

8•malshe•1d ago•2 comments

Ask HN: What is the "Control Plane" for local AI agents?

7•denis4inet•1d ago•8 comments

Ask HN: How do you find contracting/freelance roles without recruiters nowadays?

3•Gooblebrai•14h ago•2 comments

Ask HN: Who still works async and has a 'no meetings' work policy in 2026?

7•RaulOnRails•1d ago•5 comments

Facebook Appears to Be Down

13•Molitor5901•1d ago•16 comments

Ask HN: How Do Emergency Alerts on Phone Work?

5•rishikeshs•2d ago•4 comments

Tell HN: GitHub Having Issues

52•Sytten•1d ago•27 comments