frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: Who is hiring? (April 2026)

266•whoishiring•1d ago•303 comments

Ask HN: Who wants to be hired? (April 2026)

81•whoishiring•1d ago•289 comments

Ask HN: What is your dev set up like?

11•break_the_bank•15h ago•18 comments

Ask HN: European Tech Alternatives?

94•BrunoBernardino•13h ago•51 comments

Ask HN: Distributed data centers in our basements

72•cmos•2d ago•65 comments

Stripe closed my UAE business account and is withholding $3.5K

20•alganzory•16h ago•2 comments

Ask HN: How relevant is low code according to you in today’s world with Claude?

4•kinj28•11h ago•2 comments

Tell HN: Chrome says "suspicious download" when trying to download yt-dlp

305•joering2•2d ago•95 comments

Ask HN: Client took over development by vibe coding. What to do?

58•piscator•1d ago•38 comments

LinkedIn uses 2.4 GB RAM across two tabs

792•hrncode•4d ago•452 comments

Ask HN: What dev tools do you rely on that nobody talks about?

37•crcsmnky•1d ago•30 comments

Ask HN: Should there be a temporary ban on new accounts?

10•l33tbro•10h ago•15 comments

Ask HN: Academic study on AI's impact on software development – want to join?

30•research2026•2d ago•15 comments

BetterDB – open-source Redis/Valkey migration across clouds and self-hosted

14•kaliades•1d ago•6 comments

Ask HN: What happens when you block/mark as spam a call or text?

12•dsalzman•1d ago•2 comments

Are tech companies even hiring?

19•KernelPryanic•1d ago•18 comments

Ask HN: How do you know if a tweak to your AI skill made it better?

7•yo103jg•1d ago•4 comments

LinkedIn uses 65GB of RAM with 7 tabs opened

14•daniele_dll•2d ago•9 comments

Ask HN: Who needs contributors? (March 2026)

28•Kathan2651•4d ago•16 comments

Ask HN: Does anyone else notice that gas runs out faster than usual

20•cat-turner•3d ago•31 comments

Ask HN: What was it like in the era of BBS before the internet?

31•ex-aws-dude•3d ago•35 comments

Why did Harvey choose a top-down enterprise GTM while Cursor went bottom-up?

4•iiTsEddy•1d ago•3 comments

Non-US founders residential address problem with Brex, Mercury?

8•Barazutti629•2d ago•1 comments

Ask HN: Books on Unit Testing and TDD?

7•Desafinado•1d ago•6 comments

Are you team MCP or team CLI?

15•sharath39•3d ago•20 comments

Ask HN: Freelancer? Seeking freelancer? (April 2026)

5•jon_north•1d ago•10 comments

Ask HN: Is there any founder building non AI startup in 2026?

13•daudmalik06•2d ago•9 comments

EPropelled Electric Propulsion Motors and Controllers for Uncrewed Vehicles

3•ePropelled•1d ago•0 comments

You've reached the end!

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•10mo ago
give https://pg.llmwhisperer.unstract.com/ a try