frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•10mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: Broken "Public Research Articles" Link: Any Support for Google Scholar?

2•yann-gael•2h ago•0 comments

How are Iranian drones getting their guidance?

5•dottenad•2h ago•2 comments

New Open Source Release

6•BrainDAnderson•3h ago•2 comments

Ask HN: The new wave of AI agent sandboxes?

8•ATechGuy•6h ago•2 comments

Tell HN: If working with agents means this, robots, please take my job

9•tiredagent•5h ago•9 comments

Ask HN: Are we ready for vulnerabilities to be words instead of code?

2•lielcohen•5h ago•8 comments

Spotify playing ads for paid subscribers

136•IncandescentGas•1d ago•120 comments

European municipalities leak citizen data to US companies

7•sam_lowry_•7h ago•0 comments

Ask HN: How to Find a Job in the UK

6•0x3444ac53•7h ago•1 comments

Ask HN: What do you look for in your first 10 hires?

24•neilk17•19h ago•29 comments

Ask HN: How do you deal with people who trust LLMs?

145•basilikum•1d ago•189 comments

TTal – CLI that turns Claude Code into a multi-agent software factory

4•neilbb•21h ago•1 comments

Ask HN: What is it like being in a CS major program these days?

210•tathagatadg•3d ago•201 comments

Ask HN: Who is still using Windsurf and why?

9•mak8•19h ago•5 comments

I built a game where you argue consumer rights against AI bots

5•dragonmann•13h ago•1 comments

Ask HN: Have you cancelled any software subscriptions because AI replaced them?

7•maxim_bg•17h ago•9 comments

Ask HN: Is vibe coding a new mandatory job requirement?

33•newswangerd•2d ago•71 comments

Tell HN: AI tools are making me lose interest in CS fundamentals

98•Tim25659•4d ago•92 comments

Ask HN: Is Claude down Again?

10•rreyes1979•1d ago•6 comments

Ask HN: Why is everyone on HN obsessed with Rust?

14•goldkey•1d ago•10 comments

Ask HN: How are you protecting yourself from skill atrophy?

5•xpnsec•1d ago•11 comments

Open AI is actively censoring information about voting today in the US

11•resters•2d ago•16 comments

Ask HN: Which router makers do you trust most?

9•general_reveal•1d ago•12 comments

Claude Code 500s

16•bavarianbob•2d ago•5 comments

Skills Manager – manage AI agent skills across Claude, Cursor, Copilot

3•evergreenxx•1d ago•8 comments

WFH is becoming a benefit again

61•sharemywin•13h ago•71 comments

Knowledge workers managing AI show collapsed productivity, not just a plateau

6•dfordp11•1d ago•2 comments

Ask HN: Did GitHub remove Opus and Sonnet from their Copilot Pro subscription?

14•lgl•3d ago•7 comments

Ask HN: How to Learn C++ in 2026?

15•creatorcoder•4d ago•14 comments

Ask HN: We need to learn algorithm when there are Claude Code etc.

11•JasonHEIN•2d ago•2 comments