frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•10mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Tell HN: H&R Block tax software installs a TLS backdoor

122•yifanlu•1d ago•9 comments

Are AI Agents like von Hammerstein's industrious and stupid?

12•multidude•14h ago•7 comments

Ask HN: what’s your favorite line in your Claude/agents.md files?

7•khasan222•18h ago•3 comments

Ask HN: Why isn't the NSA categorized as an APT?

4•TheOpenSourcer•14h ago•8 comments

Spotify playing ads for paid subscribers

147•IncandescentGas•3d ago•126 comments

Structural Friction: A metric for human coordination cost

5•davidvartanian•1d ago•0 comments

Anchor: Hardware-based authentication using SanDisk USB devices

4•rewant•2d ago•1 comments

Ask HN: Is operational memory a missing layer in AI agent architecture?

3•varunrrai•1d ago•0 comments

Ask HN: What do you look for in your first 10 hires?

28•neilk17•2d ago•32 comments

Ask HN: What features does a chip require to beat Nvidia in the AI race?

4•caloricflow•1d ago•12 comments

European municipalities leak citizen data to US companies

11•sam_lowry_•2d ago•4 comments

Ask HN: How do you deal with people who trust LLMs?

152•basilikum•3d ago•199 comments

Ask HN: What are the greatest discoveries in the last few years?

14•chistev•1d ago•4 comments

Ask HN: Is vibe coding a new mandatory job requirement?

36•newswangerd•4d ago•74 comments

Ask HN: Have you cancelled any software subscriptions because AI replaced them?

10•maxim_bg•2d ago•14 comments

Ask HN: The new wave of AI agent sandboxes?

11•ATechGuy•2d ago•4 comments

Ask HN: Are you using a Pivotal Tracker successor? Which one?

2•antfarm•1d ago•3 comments

Ask HN: How to Find a Job in the UK

9•0x3444ac53•2d ago•9 comments

TTal – CLI that turns Claude Code into a multi-agent software factory

5•neilbb•2d ago•3 comments

I built a game where you argue consumer rights against AI bots

7•dragonmann•2d ago•3 comments

Ask HN: Broken "Public Research Articles" Link: Any Support for Google Scholar?

4•yann-gael•2d ago•2 comments

Ask HN: Who is still using Windsurf and why?

10•mak8•2d ago•9 comments

Ask HN: Can we make up/downvotes/flags visible on Hacker News Archives (delayed)

6•gpt5•2d ago•1 comments

How are Iranian drones getting their guidance?

7•dottenad•2d ago•4 comments

Tell HN: If working with agents means this, robots, please take my job

13•tiredagent•2d ago•14 comments

Ask HN: Are we ready for vulnerabilities to be words instead of code?

5•lielcohen•2d ago•11 comments

Ask HN: Are AI mandates a good idea?

4•Solar_Flare•1d ago•15 comments

Ask HN: Is Google disabling accounts for using gws CLI with OpenClaw?

3•anandvc•2d ago•1 comments

Open AI is actively censoring information about voting today in the US

13•resters•4d ago•15 comments

Claude Code 500s

18•bavarianbob•4d ago•5 comments