frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•10mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: what’s your favorite line in your Claude/agents.md files?

3•khasan222•42m ago•0 comments

Tell HN: H&R Block tax software installs a TLS backdoor

79•yifanlu•17h ago•6 comments

Ask HN: Is operational memory a missing layer in AI agent architecture?

2•varunrrai•9h ago•0 comments

Structural Friction: A metric for human coordination cost

4•davidvartanian•21h ago•0 comments

Spotify playing ads for paid subscribers

143•IncandescentGas•2d ago•124 comments

Ask HN: What are the greatest discoveries in the last few years?

11•chistev•13h ago•3 comments

AI World

2•Shashankms•16h ago•0 comments

Ask HN: What features does a chip require to beat Nvidia in the AI race?

3•caloricflow•16h ago•9 comments

Ask HN: Are AI mandates a good idea?

4•Solar_Flare•7h ago•14 comments

Anchor: Hardware-based authentication using SanDisk USB devices

3•rewant•2d ago•0 comments

Ask HN: What do you look for in your first 10 hires?

28•neilk17•2d ago•31 comments

Ask HN: Are you using a Pivotal Tracker successor? Which one?

2•antfarm•22h ago•2 comments

Ask HN: How do you deal with people who trust LLMs?

149•basilikum•2d ago•198 comments

Ask HN: The new wave of AI agent sandboxes?

10•ATechGuy•1d ago•4 comments

European municipalities leak citizen data to US companies

10•sam_lowry_•1d ago•2 comments

Ask HN: How to Find a Job in the UK

9•0x3444ac53•1d ago•6 comments

Ask HN: Is vibe coding a new mandatory job requirement?

35•newswangerd•3d ago•72 comments

TTal – CLI that turns Claude Code into a multi-agent software factory

5•neilbb•2d ago•3 comments

Ask HN: Broken "Public Research Articles" Link: Any Support for Google Scholar?

4•yann-gael•1d ago•2 comments

Ask HN: When Will AI Disappear?

4•roschdal•5h ago•7 comments

I built a game where you argue consumer rights against AI bots

7•dragonmann•1d ago•3 comments

Ask HN: Can we make up/downvotes/flags visible on Hacker News Archives (delayed)

6•gpt5•1d ago•1 comments

How are Iranian drones getting their guidance?

6•dottenad•1d ago•4 comments

Ask HN: Have you cancelled any software subscriptions because AI replaced them?

9•maxim_bg•2d ago•12 comments

Tell HN: If working with agents means this, robots, please take my job

12•tiredagent•1d ago•11 comments

Ask HN: Who is still using Windsurf and why?

9•mak8•2d ago•9 comments

Ask HN: Are we ready for vulnerabilities to be words instead of code?

4•lielcohen•1d ago•11 comments

Open AI is actively censoring information about voting today in the US

13•resters•3d ago•16 comments

Ask HN: Is Claude down Again?

11•rreyes1979•2d ago•6 comments

Claude Code 500s

18•bavarianbob•3d ago•5 comments