frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

4•martinald•1y ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•1y ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)
constantinum•1y ago
give https://pg.llmwhisperer.unstract.com/ a try

Ask HN: Why didn't the C64 come with Simons' BASIC in the box from 1983 onward?

7•amichail•4h ago•7 comments

Ask HN: Best worldwide / classic phone games?

4•bix6•9h ago•4 comments

Ask HN: Did Messages get removed from Google Takeout?

19•MollyRealized•23h ago•1 comments

Is a Claw driven Hacker News user a problem?

6•delichon•9h ago•9 comments

Are Tech Meetups Dead?

8•mlinhares•11h ago•10 comments

$100 to upgrade Fresh IDE for ePub TUI reading

3•jph•12h ago•4 comments

Ask HN: Does anyone what a "RiotCache.dat" file was doing in my EFI partition?

5•kromerless•1d ago•7 comments

The solution the supply chain problems is removing your deps from .gitignore

8•SchizoDuckie•23h ago•3 comments

Ask HN: Is $300/HR too low these days for custom full stack?

15•noduerme•1d ago•11 comments

Ask HN: How to learn how to develop real time applications?

5•creatorcoder•1d ago•6 comments

Ask HN: Shouldn't Google need to give a public statement about Railway incident?

177•srameshc•4d ago•105 comments

Using games/cards to learn new skills

5•dominikz•1d ago•0 comments

Alternatives to HN for "tech outside of AI" discussion?

59•summonerOS•4d ago•35 comments

Ask HN: Do you have a colophon for your personal website?

8•susam•1d ago•5 comments

Ask HN: Are there any serious efforts to organize tech labor now?

36•0rganize•4d ago•28 comments

We were building infra for OpenClaw, and today I just tried Hermes and holy shit

5•Stanlyya•1d ago•1 comments

Ask HN: Failing interviews for mid-level SWE in UK, advice please

19•mjb8086•3d ago•12 comments

Ask HN: How much AI is in your writing?

9•js98•2d ago•30 comments

Ask HN: OpenAI, SpaceX/xAI, Anthropic all to IPO, is this a sign of the peak?

8•virgildotcodes•2d ago•10 comments

Tell HN: I went to Alaska's northernmost town and this was the GeoIP location

5•ironmagma•2d ago•0 comments

Ask HN: How to increase depth instead of breadth as 10 yoe as swe?

8•Cheesebh•1d ago•6 comments

Tell HN: Google banned Railway's account. Everything down

32•sergiotapia•5d ago•18 comments

Valgrind-3.27.1 Is Available

12•paulf38•3d ago•1 comments

Ask HN: What to learn and do, that makes me least affected by AI in STEM?

7•s3arch•1d ago•6 comments

Ask HN: What's New in Crypto?

5•kaycebasques•2d ago•11 comments

"Subligence" – proposed coinage for LLM "intelligence"

4•akiarie•2d ago•5 comments

You've reached the end!