frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

If every OS was open source, you could run vista

3•ms_menardi•3h ago•0 comments

Ask HN: Is anyone working at least 4 hours daily on an Apple Vision Pro?

141•widenrun•21h ago•95 comments

Ask HN: How do you model temporarily invalid data structures

2•escot•7h ago•1 comments

Sqlit – A lazygit-style TUI for SQL databases

19•MaxTeabag•1d ago•1 comments

We built a B2B API infrastructure for resume parsing and tailoring

3•engspeedy•13h ago•0 comments

Amdahl's Law for LLM generated code

3•akiarie•16h ago•1 comments

Did the Linux memory management maintainer "just quit"?

12•hliyan•1d ago•3 comments

Ask HN: When and why did you start believing in God?

7•dvrp•22h ago•35 comments

Ask HN: Why didn't the C64 come with Simons' BASIC in the box from 1983 onward?

15•amichail•2d ago•13 comments

Ask HN: How do you prepare to tech interview? Any tip and tricks?

16•DmitriyBuchilin•1d ago•19 comments

Ask HN: Did Messages get removed from Google Takeout?

22•MollyRealized•3d ago•1 comments

Ask HN: Why do we snore? And what to do about it?

19•simonebrunozzi•1d ago•17 comments

S there room for a VPN with zero Five Eyes servers and RAM-only infrastructure?

4•yanbinette•1d ago•2 comments

Ask HN: Shouldn't Google need to give a public statement about Railway incident?

180•srameshc•6d ago•105 comments

Ask HN: Best worldwide / classic phone games?

7•bix6•2d ago•7 comments

Ask HN: How to get back into programming without AI?

9•coolcoder613•1d ago•18 comments

Are Tech Meetups Dead?

15•mlinhares•2d ago•13 comments

Alternatives to HN for "tech outside of AI" discussion?

61•summonerOS•1w ago•37 comments

Ask HN: Are there any serious efforts to organize tech labor now?

40•0rganize•6d ago•29 comments

Ask HN: Is $300/HR too low these days for custom full stack?

20•noduerme•3d ago•15 comments

You've reached the end!