frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•4mo ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•4mo ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•3mo ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: What's a good 3D Printer for sub $1000?

41•lucideng•2d ago•38 comments

Ask HN: Is anyone else sick of AI splattered code

58•throwaway-ai-qs•49m ago•43 comments

Ask HN: Is Claude Code less useful in recent weeks for you?

6•vintagedave•2h ago•3 comments

I launched a Mac utility; now there are 5 clones on the App Store using my story

119•tTarnMhrkm•18h ago•99 comments

Is the era of personal software portfolios over?

8•justanotherunit•9h ago•8 comments

Scanoss GitHub Actions Adds Dependency Track Integration

2•scanosss•7h ago•0 comments

Ask HN: How to deal with fake job applicants?

15•rswerve•17h ago•19 comments

Ask HN: Generalists, when do you say "I know enough" about any particular topic?

30•AbstractH24•1d ago•85 comments

Paid $2400 to Cloudflare, support refuses to help

140•thekonqueror•2d ago•29 comments

Advertising in Microsoft Excel

11•BLKNSLVR•16h ago•7 comments

Ask HN: Why isn't capability-based security more common?

11•killerstorm•1d ago•20 comments

Ask HN: Claude file creation/edit feature leading to worse coding performance?

2•arjunchint•23h ago•1 comments

How WASM DB and worker messaging helped me handle 500MB in 2s in browser

6•vinserello•2d ago•2 comments

Ask HN: What Single File Web Apps do you know of?

10•calebm•1d ago•18 comments

Google Ends Support for Lynx Browser

102•zhenyi•6d ago•43 comments

You've reached the end!