frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: Are you still using a Vision Pro?

132•y1n0•6h ago•160 comments

Ask HN: Why hasn't there been a real competitor to Ticketmaster yet?

242•mdni007•1d ago•221 comments

Ask HN: Favorite text heavy blogs the are a joy to read

23•joshmarinacci•5h ago•8 comments

Ask HN: What are tools you have made for yourself since the advent of AI?

413•aryamaan•1d ago•703 comments

Ask HN: Is software engineering still a good career choice for new students?

3•iliashad•2h ago•1 comments

Ask HN: Prediction for SpaceX IPO?

4•bix6•5h ago•6 comments

Ask HN: What was your "oh shit" moment with GenAI?

731•andrehacker•5d ago•1106 comments

Ask HN: How are you preserving your skills while using AI?

6•rdrmc•8h ago•4 comments

Ask HN: How to escalate a rejected Google extension?

23•modzu•1d ago•12 comments

Ask HN: So what happened to Facebook "localhost" tracking?

106•juliusceasar•5d ago•102 comments

Ask HN: How do you cope when your startup contracts?

14•jasonephraim•1d ago•13 comments

LeetCode is the best way to learn a new language

2•JasonHEIN•14h ago•2 comments

Authorization via Gmail and Apple ID Banned in Russia

8•levleontiev•6h ago•1 comments

Ask HN: What's your favorite HN Recap like podcast?

5•randomor•2d ago•2 comments

Ask HN: Which companies gained a competitive edge purely via engineering?

5•j1000•1d ago•7 comments

Ask HN: What is your (AI) dev tech stack / workflow?

165•dv35z•4d ago•134 comments

Ask HN: Why is the HN crowd so anti-AI?

457•Ekami•3d ago•756 comments

Ask HN: Why won't you be replaced by AI?

10•atleastoptimal•1d ago•37 comments

Ask HN: Options for critical thinking and learning outside work?

6•hnthrow10282910•1d ago•5 comments

Ask HN: Where do you like to consume content anymore?

6•selectedambient•22h ago•9 comments

Ask HN: How do you find deep technical content?

40•f311a•5d ago•28 comments

Ask HN: How are thinking efforts implemented?

26•simianwords•2d ago•17 comments

Ask HN: Gin rummy strategies

24•bix6•5d ago•4 comments

Ask HN: Is there any data on whether users prefer voice/chatbot experiences?

2•fnimick•1d ago•4 comments

Ask HN: How do PaaS hosting providers enforce user policy compliance?

2•iishanto•1d ago•0 comments

Ask HN: What is happening with the Meta Ads dashboard?

4•ramon156•2d ago•0 comments

Ask HN: Were CS profs right to look down on programming in light of modern AI?

6•amichail•3d ago•3 comments

Ask HN: Open-Source Software in Aerospace Imaging?

6•lightedman•1d ago•0 comments

Ask HN: What is the AI setup for an experienced dev starting on a new project?

5•postexitus•1d ago•9 comments

Bad MCP design costs your agent 5x more tokens

15•JohnnyZhang483•4d ago•1 comments