frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

Build an agent first Kanban board

3•moose6912•11h ago•1 comments

Ask HN: Where is the programming profession going?

159•syntaxbush•5d ago•170 comments

Is aerc better than neomutt now?

4•hardikxk•14h ago•1 comments

Ask HN: Homeless, Former Software Developer, What Now?

12•current_robot•15h ago•10 comments

Ask HN: How do you handle QA at a startup with no QA team? Genuinely curious

4•ovi_firstqa•15h ago•9 comments

The open source DOCX editor submitted to HN a few weeks ago has been deleted

104•gcanyon•3d ago•44 comments

Ask HN: Mullvad Alternatives?

12•rpastuszak•10h ago•6 comments

Ask HN: Is "no source code was copied" still a sufficient copyright defense?

66•oscgam1•3d ago•80 comments

Ask HN: Books about Genetic Algorithms

14•andyjohnson0•1d ago•9 comments

Ask HN: What do SRE do at your company?

7•petemc_•1d ago•8 comments

Ask HN: MacBook vs. Dedicated GPU for LLM

36•mzubairtahir•3d ago•67 comments

Ask HN: How much coding should beginners learn in the AI era?

39•JohnDSDev•5d ago•52 comments

Everyone feared AI taking over; the real danger is AI serving just the few

111•PhilipDaineko•2d ago•75 comments

Ask HN: What do you predict the world will look like in 5-10 years?

11•justanything•2d ago•16 comments

I patched llama.cpp to gain 20% prompt processing TPS. Help me make a PR

6•i_am_rocoe•2d ago•2 comments

Ask HN: Smallest amount of working ML weights that can be tattooed on a body?

8•thoughtpeddler•2d ago•8 comments

Ask HN: Is there a bad employers (who have a records of not paying) list?

53•trowa159•1d ago•65 comments

Ask HN: What GUI/desktop app do you use to keep track of different AI sessions?

5•howToTestFE•2d ago•6 comments

Ask HN: How do I capture the right audience and find the product market fit

7•akarshhegde18•1d ago•13 comments

Ask HN: Norway bans AI in elementary schools

16•mellosty•4d ago•19 comments

Tell Zillow: Fee-Simple vs. Leasehold Filter

6•HoldOnAMinute•3d ago•1 comments

Ask HN: Has Ilya Sutskever spoken publicly lately?

11•aurenvale•2d ago•2 comments

Ask HN: What home printer do you use/recommend?

20•niyazpk•6d ago•23 comments

Fast feedback loops is the way

5•skyglider•2d ago•1 comments

Recursive self improvement for human skills

4•rando77•2d ago•2 comments

We need tech news sources which exclude AI

135•botfriendsarent•1d ago•84 comments

Ask HN: Techniques for learning things quickly using coding agents?

6•throwaw12•3d ago•2 comments

Roblox parental controls are a dystopian security disaster

23•notsure357•3d ago•6 comments

Data Privacy while using API tools

4•11shyam11•2d ago•5 comments

You've reached the end!