frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: Best on device LLM tooling for PDFs?

1•martinald•5h ago
I've got very used to using the "big" LLMs for analysing PDFs

Now llama.cpp has vision support; I tried out PDFs with it locally (via LM Studio) but the results weren't as good as I hoped for. One time it insisted it couldn't do "OCR", but gave me an example of what the data _could_ look like - which was the data.

The other major problem is sometimes PDFs are actually made up of images; and it got super confused on those as well.

Given this is so new I'm struggling to find any tools which make this easier.

Comments

raymond_goo•5h ago
Try something like this

  !pip install pytesseract pdf2image pillow
  !apt install poppler-utils
  #!apt install tesseract-ocr
  from pdf2image import convert_from_path
  import pytesseract

  pages = convert_from_path('k.pdf', dpi=300)

  all_text = ""
  for page_num, img in enumerate(pages, start=1):
      text = pytesseract.image_to_string(img)
      all_text += f"\n--- Page {page_num} ---\n{text}"

  print(all_text)

Thailand to recriminalise cannabis as PM vows to get tough on drugs

https://www.abc.net.au/asia/thailand-to-recriminalise-cannabis/103846102
1•testrun•2m ago•0 comments

Show HN: Datetime.app – open-source alternative to time.is

https://datetime.app
1•Airyisland•3m ago•0 comments

America's College Towns Go from Boom to Bust

https://www.wsj.com/us-news/education/college-towns-economy-macomb-illinois-aae84dcc
1•JumpCrisscross•11m ago•0 comments

The principles of database design, or, the Truth is out there

https://ebellani.github.io/blog/2025/the-principles-of-database-design-or-the-truth-is-out-there/
1•b-man•11m ago•0 comments

Create a Cursor/Windsurf Clone in Python from Scratch (1-Hour Challenge) [video]

https://www.youtube.com/watch?v=zMnNVJkIf6I
2•marufm•17m ago•0 comments

Layers All the Way Down: The Untold Story of Shader Compilation

https://moonside.games/posts/layers-all-the-way-down/
1•birdculture•20m ago•0 comments

Async/Await versus the Calloop Model

https://notgull.net/calloop/
1•todsacerdoti•21m ago•0 comments

Small banks fuel revival in blank-cheque SPAC deals

https://www.ft.com/content/6eb4b7b6-af96-48a8-ac3a-cea61c5dc0ed
1•JumpCrisscross•22m ago•0 comments

Nasa Shares Secrets About Interiors of Moon, Vesta

https://www.nasa.gov/solar-system/asteroids/vesta/nasa-studies-reveal-hidden-secrets-about-interiors-of-moon-vesta/
1•colinprince•33m ago•0 comments

Centrist Dan wins Romanian presidency over hard-right pro-Trump rival

https://www.reuters.com/world/europe/romanians-vote-presidential-run-off-that-could-widen-eu-rifts-2025-05-17/
2•Bondi_Blue•35m ago•0 comments

Passkeys Debugger

https://www.passkeys-debugger.io
1•cypherpunks01•37m ago•0 comments

Comparison of Waymo Crash Rates to Human Benchmarks at 56.7M Miles

https://arxiv.org/abs/2505.01515
3•PaulHoule•38m ago•0 comments

Ask HN: Moving to London from California

2•siamese_puff•42m ago•0 comments

Apple boosts India's factory hopes – but a US-China deal could derail plans

https://www.bbc.com/news/articles/cly34p1jwvgo
3•tartoran•46m ago•0 comments

OpenAI's planned data center in Abu Dhabi would be bigger than Monaco

https://techcrunch.com/2025/05/16/openais-planned-data-center-in-abu-dhabi-would-be-bigger-than-monaco/
1•bookofjoe•49m ago•0 comments

What we in the open world are messing up in trying to compete with big tech

https://berthub.eu/articles/posts/what-the-open-world-must-do-better/
2•pabs3•52m ago•0 comments

Poireau: A Sampling Allocation Debugger

https://github.com/backtrace-labs/poireau
2•luu•59m ago•0 comments

Anthropic blames ClaudeAI for embarrassing unintentional mistake in legal filing

https://www.theverge.com/news/668315/anthropic-claude-legal-filing-citation-error
3•croes•1h ago•1 comments

Home – The Cozy CMS

https://home.bearcove.eu/
2•japrozs•1h ago•1 comments

Breaking writer's block: AI character profile generation in seconds

https://characterheadcanongen.com/
1•MikeHalloween•1h ago•1 comments

Show HN: Grace – Orchestrate Hybrid Mainframe and Cloud Workflows with YAML

https://graceinfra.org
2•arnavsurve•1h ago•0 comments

We sold coffee from the terminal [video]

https://www.youtube.com/watch?v=POlZS8PcyZw
1•randomcatuser•1h ago•1 comments

The Utter Flimsiness of XAI's Processes

https://smol.news/p/the-utter-flimsiness-of-xais-processes
3•pavel_lishin•1h ago•0 comments

An Asia Internet History: First Decade (1980-1990)

https://sites.google.com/site/internethistoryasia/book1
1•todsacerdoti•1h ago•0 comments

Letting the AIs Judge Themselves: A One Creative Prompt: The Coffee-Ground Test

https://tryaii.com/blog/llms-self-scoring-coffee-benchmark
1•tamtampo•1h ago•2 comments

Good Design Comes from Looking, Great Design Comes from Looking Away

https://www.chrbutler.com/good-design-comes-from-looking-great-design-comes-from-looking-away
1•PStamatiou•1h ago•0 comments

Photon Emission(2017)

https://www.physicsbook.gatech.edu/Photon_Emission
1•rolph•1h ago•0 comments

Show HN: Dub your videos with a few clicks

https://dublab.app
1•behramcelen•1h ago•0 comments

'Robocake' includes edible chocolate batteries

https://www.cnn.com/2025/05/16/science/video/robocake-edible-battery-digvid
1•dabinat•1h ago•0 comments

I Designed a Thinking Machine: A New Blueprint for Human-Like AGI [video]

https://www.youtube.com/watch?v=niMYHWpJITA
1•derekv123•1h ago•1 comments