frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•10mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•10mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•10mo ago
give https://pg.llmwhisperer.unstract.com/ a try

LinkedIn uses 2.4 GB RAM across two tabs

7•hrncode•2h ago•0 comments

Ask HN: Founders of estonian e-businesses – is it worth it?

164•udl•5d ago•107 comments

The risk of AI isn't making us lazy, but making "lazy" look productive

69•acmerfight•20h ago•74 comments

Ask HN: How are you keeping AI coding agents from burning money?

3•bhaviav100•11h ago•8 comments

Ask HN: What's the latest concensus on OpenAI vs. Anthropic $20/month tier?

6•whatarethembits•11h ago•4 comments

Ask HN: Is it just me?

11•twoelf•18h ago•14 comments

I built an AI that tailors your CV to every job in seconds

3•alebarbon•20h ago•2 comments

Ask HN: Anyone using Meshtastic/LoRa for non-chat applications?

11•redgridtactical•1d ago•0 comments

Claude API Error: 529

25•anujbans•2d ago•14 comments

Fear of Missing Code

7•lukol•1d ago•7 comments

Ask HN: Anybody tried to cheat AI-HR-system with hidden/white sentences?

4•KellyCriterion•1d ago•3 comments

Repsy – A lightweight, open-source alternative to Nexus/Artifactory

6•nuricanozturk•2d ago•0 comments

GitHub has been sending me an email every two seconds.

16•colonelspace•1d ago•3 comments

LLMs learn what programmers create, not how programmers work

41•noemit•5d ago•21 comments

Tell HN: Pangram is easily-defeatable with Claude

3•nunez•1d ago•4 comments

GitHub now requiring 2FA for all contributors,what authenticator apps you using?

13•nickcageinacage•2d ago•38 comments

Rses – cross-resume between Claude Code, Codex, and OpenCode

13•plawlost•3d ago•5 comments

Ask HN: How do you deal with obvious AI assistant usage in interviews

10•stackdestroyer•2d ago•19 comments

Ask HN: What do you use for normative specs to drive AI agents?

4•midnight_eclair•1d ago•4 comments

Lazy Tmux – Lazy-loading tmux sessions with a tree view

8•Alchemmist•4d ago•1 comments

Ask HN: Is anyone here also developing "perpetual AI psychosis" like Karpathy?

32•jawerty•5d ago•29 comments

Ask HN: Is using AI tooling for a PhD literature review dishonest?

10•latand6•5d ago•29 comments

You've reached the end!