frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•9mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•9mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•9mo ago
give https://pg.llmwhisperer.unstract.com/ a try

LazyGravity – I made my phone control Antigravity so I never leave bed

3•masaTokyo•2h ago•2 comments

Ask HN: What's it like working in big tech recently with all the AI tools?

10•ex-aws-dude•4h ago•3 comments

Ask HN: Who Is Using XMPP?

14•nunobrito•14h ago•10 comments

Ask HN: Programmable Watches with WiFi?

34•dakiol•4d ago•24 comments

Ask HN: Could you create a competitor to your company at 10% of the cost?

7•TheAlchemist•5h ago•8 comments

Fix cron routes: POST → GET (Vercel cron sends GET)

2•nishiohiroshi•6h ago•0 comments

Ask HN: What will happen with Anthropics ultimatum?

5•maniacwhat•6h ago•2 comments

1Password pricing increasing up to 33% in March

137•otterley•1d ago•187 comments

New Claude Code Feature "Remote Control"

8•rob•1d ago•0 comments

Ask HN: Chromebook leads for K-8 school in need?

45•techteach00•3d ago•44 comments

Ask HN: Have top AI research institutions just given up on the idea of safety?

75•DietaryNonsense•15h ago•84 comments

Ask HN: Starting a New Role with Ada

6•NoNameHaveI•14h ago•3 comments

Ask HN: How do you know if AI agents will choose your tool?

33•dmpyatyi•2d ago•23 comments

Would you choose the Microsoft stack today if starting greenfield?

20•JB_5000•2d ago•21 comments

Ask HN: How are you controlling AI agents that take real actions?

7•thesvp•1d ago•18 comments

Comparing manual vs. AI requirements gathering: 2 sentences vs. 127-point spec

5•thesssaism•1d ago•4 comments

Ask HN: Who has seen productivity increases from AI

9•Kapura•1d ago•12 comments

ChatGPT finds an error in Terence Tao's math research

41•codexon•2d ago•8 comments

Persistent Prompts and Built in Search

2•lazybusy•1d ago•0 comments

Ask HN: Share your productive usage of OpenClaw

94•aavci•1d ago•73 comments

Ask HN: Any DIY open-source Alexa/Google alternatives?

8•personality0•1d ago•4 comments

Ask HN: Is it worth avoiding AI while making a game?

6•2muchclout•1d ago•17 comments

Ask HN: What Linux Would Be a Good Transition from Windows 11

13•Cyberis•2d ago•20 comments

Using "Hi Claudette " on Claude.ai

4•mlongval•1d ago•3 comments

Looking 4 open-source knowledge base and project management tool 4 personal use

4•TheAlgorist•1d ago•5 comments

Ask HN: Is it better to have no Agent.md than a bad one?

9•parvardegr•2d ago•10 comments

Ask HN: Are AI "Chatbot Wrappers" ruining EdTech? I'm testing a proactive UX

2•Swadesh9422•1d ago•3 comments

Ask HN: Missing page from Practical Computing magazine (1980)

3•sim04ful•1d ago•0 comments

Ask HN: Why doesn't HN have a rec algorithm?

9•sujayk_33•4d ago•22 comments

Ask HN: What Comes After Markdown?

9•YuukiJyoudai•3d ago•13 comments