frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•8mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•8mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•8mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: DDD was a great debugger – what would a modern equivalent look like?

28•manux81•11h ago•31 comments

Tell HN: I cut Claude API costs from $70/month to pennies

32•ok_orco•10h ago•14 comments

Ask HN: What software / applications can you now build thanks to AI

4•zarathustra333•5h ago•1 comments

Ask HN: Freelance Qt C++

2•shchess•4h ago•1 comments

Ask HN: Running UPDATEs in production always feels heavier than it should

3•Lucy_Bai•4h ago•3 comments

Ask HN: Gmail spam filtering suddenly marking everything as spam?

208•goopthink•1d ago•122 comments

Ask HN: What's the current best local/open speech-to-speech setup?

253•dsrtslnd23•2d ago•61 comments

Compiled Node.js 18 from source on jailbroken iPhone to run Claude Code

2•BryanTheCynic•7h ago•0 comments

Ask HN: Some great launch videos in recent times?

2•nemath•8h ago•0 comments

Ask HN: Do you have any evidence that agentic coding works?

456•terabytest•5d ago•452 comments

SHDL – A Minimal Hardware Description Language Built from Logic Gates

2•rafa_rrayes•10h ago•1 comments

Ask HN: What are the most significant man-made creations to date?

15•George97•20h ago•23 comments

Tell HN: 2 years building a kids audio app as a solo dev – lessons learned

136•oliverjanssen•4d ago•77 comments

I'm posting this from a memory safe web browser

38•pizlonator•13h ago•2 comments

Ask HN: Why is cursor / Claude Code is so bad at generating readmes?

4•yakshithk_•8h ago•3 comments

Ask HN: Why are so many rolling out their own AI/LLM agent sandboxing solution?

32•ATechGuy•5d ago•14 comments

Ask HN: Have we confused efficiency with "100% utilization"?

27•nickevante•1d ago•20 comments

Ask HN: How to reach out to a commenter under an old submission (nick_m)?

4•jsumn•20h ago•4 comments

Ask HN: What usually happens after a VC asks for a demo?

12•stijo•1d ago•6 comments

Ask HN: May an agent accept a license to produce a build?

26•athrowaway3z•1d ago•48 comments

Ask HN: Revive a mostly dead Discord server

21•movedx•5d ago•29 comments

Ask HN: Career transition question – assistance, MLOps guidance

4•Pierre_Esteves•1d ago•0 comments

Ask HN: Why does the number of datasets on data.gov vary so much?

8•akudha•1d ago•4 comments

Ask HN: Thinking about memory for AI coding agents

7•hoangnnguyen•1d ago•9 comments

Ask HN: What are some good unintuitive statistics problems?

6•ronbenton•1d ago•7 comments

Ask HN: Rust and AI builders interested in local-first, multi-agent systems?

3•cajazzer•1d ago•8 comments

Ask HN: How to redeem a gift card without risking lock-out?

6•magnetic•1d ago•6 comments

Ask HN: Weekend Social: Top two programming languages and what they can borrow?

3•susam•1d ago•7 comments

Ask HN: Which common map projections make Greenland look smaller?

19•jimnotgym•5d ago•17 comments

Ask HN: Do you "micro-manage" your agents?

7•xinbenlv•2d ago•8 comments