frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: Why there are no actual studies that show AI is more productive?

4•make_it_sure•1h ago•4 comments

Ask HN: Would you use a job board where every listing is verified?

49•BelVisgarra•12h ago•91 comments

Add llms.txt and fix robots.txt for AI agent discoverability

2•nishiohiroshi•2h ago•0 comments

Tell HN: I'm 60 years old. Claude Code has re-ignited a passion

997•shannoncc•1d ago•883 comments

Ask HN: Last time you wrote code?

3•blinkbat•9h ago•8 comments

Whisker – Self hosted e-commerce cart, pure PHP, zero dependencies

4•eLohith•16h ago•2 comments

Ask HN: Can we talk about AI Astroturfing?

29•overgard•15h ago•25 comments

PhD interrupted by personal safety issues, now publication record is thin

3•qthrwaway•11h ago•1 comments

Ask HN: Anyone else feel this community has changed recently?

41•kypro•1d ago•25 comments

How do teams prevent duplicate LLM API calls and token waste?

3•cachelogic•13h ago•1 comments

If AI is so good, why don't we have an infinite supply of 10x engineers?

5•YounesDz•6h ago•7 comments

What Will Happen to Android?

2•MrLey•17h ago•2 comments

Tell HN: The proposed KIDS Act (HR 7757) effectively mandates biometric browsing

16•fokdelafons•1d ago•0 comments

Best Monitoring and Observability Platform?

2•kebforlifer1•18h ago•1 comments

Ask HN: What career will you switch to when AI replaces developers?

22•DGAP•17h ago•20 comments

Should AI web agents skip sponsored/ad results by default?

3•ilyasJosef•1d ago•6 comments

Ask HN: How are you using multi-agent AI systems in your daily workflow?

15•paifamily•2d ago•14 comments

Ask HN: Doctor with software development experience – careers combining both?

4•frank-cheynne•1d ago•3 comments

Ask HN: Best way to implement logging and audit trails for AI apps?

4•devstatic•1d ago•0 comments

Ask HN: Do You Enjoy Your Career in Tech Nowadays?

28•karakoram•2d ago•28 comments

Self taught gen-xers with senior dev/pm exp. Where's my imposter syndrome team?

4•_hugerobots_•1d ago•3 comments

I started making money online in 10th grade – some lessons about capital

6•udit_50•1d ago•1 comments

Ask HN: Do You Have a Homelab?

12•ricardbejarano•1d ago•6 comments

Ask HN: Anyone fought a big corp over IP theft courts?

6•NatalijaAAD•1d ago•0 comments

Turns out making games is the easy part

5•clamlotus•1d ago•3 comments

Ask HN: Has anyone noticed the fear-driven prompt suggestions that GPT5.3 makes?

14•cedarscarlett•3d ago•8 comments

$1k and the difficult future that AI has left for many

5•01-_-•21h ago•3 comments

Are there any companies who are anti-AI?

6•anti-ai-dev•1d ago•3 comments

Ask HN: Why do we still buy things by browsing catalogs?

5•dannythecount•1d ago•7 comments

I lost my ability to learn anything new because of AI and I need your opinions

23•dokdev•4d ago•28 comments
Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•9mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•9mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•9mo ago
give https://pg.llmwhisperer.unstract.com/ a try