frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: Where is the programming profession going?

116•syntaxbush•1d ago•127 comments

Overfitted a 900KB Transformer to Compress a 100MB CSV into 7MB

89•spidy__•2d ago•54 comments

Ask HN: Why does every AI demo sound perfect but real world deployment always

5•VaderAi•3h ago•3 comments

Ask HN: Norway bans AI in elementary schools

10•mellosty•11h ago•7 comments

Ask HN: How much coding should beginners learn in the AI era?

32•JohnDSDev•1d ago•43 comments

I feel like VSCode is falling apart

4•othmanosx•16h ago•4 comments

Tell HN: OpenAI has started putting ads on paid programs

111•shantnutiwari•20h ago•62 comments

Decoupling Compute and Memory for Async GPUs

7•yiyingzhang•17h ago•2 comments

Ask HN: What surprised you about Estonia e-Residency and running an Estonian OÜ?

78•jvilalta•20h ago•65 comments

Trying to recover from thin content penalty from Google

4•anitroves•14h ago•3 comments

My website gets more attacks than human visitors

3•tommy2970•15h ago•3 comments

Ask HN: Quickbooks Alternative?

2•bix6•16h ago•0 comments

Google AI overview for "keynesian economics" is written in Korean

4•something765478•16h ago•3 comments

Ask HN: Do you thank your agents when they did a good job?

6•ex-aws-dude•18h ago•9 comments

As; HN: I was curious why MTP affects PP TPS in llama.cpp. My PoC recovers it?

2•i_am_rocoe•20h ago•1 comments

Ask HN: What home printer do you use/recommend?

18•niyazpk•2d ago•21 comments

Ask HN: What are the hardest problems AWS Lambda MicroVMs can solve now?

6•iaziz786•1d ago•2 comments

Got access to Gemini's actual thinking

4•StizzurpXDD•1d ago•0 comments

Ask HN: Will programmers write more efficient code during the memory shortage?

153•amichail•6d ago•246 comments

How to find AI-conservative companies to work for?

20•tossitawayplz•2d ago•12 comments

Ask HN: Yahoo deleted all my emails. Now what?

15•neya•2d ago•13 comments

Ask HN: Anthropic banned me from using Claude Code and I don't know what to do

81•ayi•3d ago•93 comments

Ask HN: Is anyone using the A2A protocol?

96•asim•1w ago•45 comments

Ask HN: What is one thing about AI that annoys you the most?

4•akashwadhwani35•15h ago•6 comments

Ask HN: Am I missing something with AI

15•vasko•2d ago•22 comments

Ask HN: What tools are you using for AI-assisted code review?

25•agos•1w ago•30 comments

Ask HN: Why don't LLM harnesses enable/expose custom middleware hooks?

8•fur-tea-laser•1d ago•8 comments

Ask HN: I miss old days of blogging without promotions

8•throwaw12•1d ago•12 comments

Tell HN: I never bought anything from clicking on a paid ad

23•julienreszka•3d ago•29 comments

You've reached the end!