frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•10mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: Who is hiring? (April 2026)

253•whoishiring•1d ago•225 comments

Ask HN: Who wants to be hired? (April 2026)

73•whoishiring•1d ago•197 comments

Ask HN: Distributed data centers in our basements

71•cmos•2d ago•63 comments

Ask HN: What dev tools do you rely on that nobody talks about?

34•crcsmnky•1d ago•28 comments

Ask HN: Client took over development by vibe coding. What to do?

52•piscator•1d ago•35 comments

Tell HN: Chrome says "suspicious download" when trying to download yt-dlp

305•joering2•2d ago•95 comments

Ask HN: What happens when you block/mark as spam a call or text?

12•dsalzman•19h ago•2 comments

LinkedIn uses 2.4 GB RAM across two tabs

790•hrncode•4d ago•452 comments

BetterDB – open-source Redis/Valkey migration across clouds and self-hosted

13•kaliades•1d ago•4 comments

Ask HN: Academic study on AI's impact on software development – want to join?

30•research2026•1d ago•15 comments

Are tech companies even hiring?

18•KernelPryanic•1d ago•13 comments

Ask HN: How do you know if a tweak to your AI skill made it better?

6•yo103jg•1d ago•3 comments

Why did Harvey choose a top-down enterprise GTM while Cursor went bottom-up?

4•iiTsEddy•19h ago•2 comments

LinkedIn uses 65GB of RAM with 7 tabs opened

14•daniele_dll•1d ago•9 comments

Ask HN: Books on Unit Testing and TDD?

7•Desafinado•22h ago•4 comments

Ask HN: Freelancer? Seeking freelancer? (April 2026)

4•jon_north•1d ago•9 comments

Non-US founders residential address problem with Brex, Mercury?

8•Barazutti629•1d ago•1 comments

Ask HN: Does anyone else notice that gas runs out faster than usual

20•cat-turner•2d ago•31 comments

Ask HN: What was it like in the era of BBS before the internet?

31•ex-aws-dude•2d ago•35 comments

Ask HN: Who needs contributors? (March 2026)

27•Kathan2651•3d ago•16 comments

EPropelled Electric Propulsion Motors and Controllers for Uncrewed Vehicles

3•ePropelled•1d ago•0 comments

Are you team MCP or team CLI?

15•sharath39•2d ago•18 comments

Notably absent from X during Artemis launch: Elon

2•boringg•14h ago•0 comments

Tell HN: Zed is sunsetting text threads

7•koito17•1d ago•0 comments

We scanned 73 open-source MCP servers. Here's what source code analysis found

2•sigildev•20h ago•2 comments

Nexus – Agent-native ops platform for ecommerce MCP, self-registration, no demos

3•karimsherif•1d ago•0 comments

Ask HN: Best stack for building a tiny game with an 11-year-old?

17•richardstahl•4d ago•28 comments

Ask HN: Is there any founder building non AI startup in 2026?

12•daudmalik06•1d ago•7 comments

Proof-engine A mathematical rendering engine for Rust

5•Shmungus•1d ago•4 comments

The risk of AI isn't making us lazy, but making "lazy" look productive

75•acmerfight•5d ago•89 comments