frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: What are tools you have made for yourself since the advent of AI?

350•aryamaan•18h ago•569 comments

Ask HN: Why hasn't there been a real competitor to Ticketmaster yet?

200•mdni007•19h ago•180 comments

LeetCode is the best way to learn a new language

2•JasonHEIN•2h ago•2 comments

Ask HN: How to escalate a rejected Google extension?

20•modzu•19h ago•12 comments

HN seems dead compared to say 10-15 years ago

36•morpheos137•19h ago•38 comments

Ask HN: What was your "oh shit" moment with GenAI?

730•andrehacker•4d ago•1105 comments

Ask HN: How do you cope when your startup contracts?

13•jasonephraim•14h ago•8 comments

Ask HN: Where do you like to consume content anymore?

5•selectedambient•9h ago•6 comments

Ask HN: What's your favorite HN Recap like podcast?

5•randomor•1d ago•2 comments

Ask HN: Thoughts on Siri AI?

2•akashwadhwani35•3h ago•1 comments

Ask HN: So what happened to Facebook "localhost" tracking?

106•juliusceasar•5d ago•102 comments

Ask HN: Which companies gained a competitive edge purely via engineering?

5•j1000•22h ago•6 comments

Ask HN: Why won't you be replaced by AI?

8•atleastoptimal•17h ago•29 comments

Ask HN: Options for critical thinking and learning outside work?

5•hnthrow10282910•18h ago•4 comments

Ask HN: What works for cutting AI token costs?

5•leoncos•5h ago•2 comments

Ask HN: What is your (AI) dev tech stack / workflow?

165•dv35z•3d ago•134 comments

Ask HN: Why is the HN crowd so anti-AI?

455•Ekami•3d ago•755 comments

Ask HN: Is there any data on whether users prefer voice/chatbot experiences?

2•fnimick•17h ago•4 comments

Ask HN: How do PaaS hosting providers enforce user policy compliance?

2•iishanto•18h ago•0 comments

Ask HN: How are thinking efforts implemented?

26•simianwords•2d ago•17 comments

Ask HN: How do you find deep technical content?

39•f311a•5d ago•28 comments

If HN policy disallows AI comments, why is linking to AI generated content ok?

13•ekelsen•13h ago•9 comments

Ask HN: Open-Source Software in Aerospace Imaging?

6•lightedman•23h ago•0 comments

Ask HN: What is the AI setup for an experienced dev starting on a new project?

5•postexitus•23h ago•9 comments

Ask HN: What is happening with the Meta Ads dashboard?

4•ramon156•1d ago•0 comments

Ask HN: Gin rummy strategies

24•bix6•4d ago•4 comments

Ask HN: Does consciousness itself require memory?

4•modinfo•17h ago•5 comments

Ask HN: Do you also feel like KYC is turning you off to the concept of money

4•JumpinJack_Cash•12h ago•0 comments

Ask HN: Were CS profs right to look down on programming in light of modern AI?

6•amichail•2d ago•3 comments

Ask HN: How do I whitelist commands in Antigravity?

5•madprops•1d ago•0 comments