frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I just built a scanned PDF text extractor for public PDFs (1-300 page)

https://readplace.com/view
1•fagnerbrack•46m ago

Comments

fagnerbrack•46m ago
For comparison: Claude only uses OCR for the first 100 pages, then falls back to text-only extract. Public URL in, HTML page out, AI throughout up to 300 pages (spartaaaaa!).

Conveniently, that's also roughly where the cost math stops working for a free tool. Scanned PDFs are best-effort OCR. Multi-page tables spanning sheets are still a weak spot.

Here's a link you can check:

https://people.math.harvard.edu/~ctm/home/text/others/shanno...

Feel free to try with your own PDF links to see what breaks, it will help improving the crawl logic and the parser (I still need to get some rate limits up)

Movesia – Control the Unity Editor with Plain English Commands

https://movesia.com
1•DexOmg•18s ago•0 comments

Distribution Fine Tuning: A post-training step to make models write better

https://dft.rosmine.ai/
1•sgt•45s ago•0 comments

Volo: Flight tracker built for group travel, delays, gates, and real connections

https://apps.apple.com/us/app/volo-flight-tracking/id6756634383
1•darpanypatel1•1m ago•0 comments

Built a subscription tracker that uses email forwarding instead of bank linking

https://www.subalert.org
1•momolii•2m ago•1 comments

Show HN: AI readiness toolkit: AI two-minute CODE maturity check

https://github.com/bjcoombs/ai-native-toolkit
1•ben30•3m ago•0 comments

Schlitz beer ending production after 177 years

https://www.boston25news.com/news/trending/schlitz-beer-ending-production-after-177-years/ABPDGCI...
1•1vuio0pswjnm7•4m ago•0 comments

JEP Draft: Enhanced Local Variable Declarations (Preview)

https://openjdk.org/jeps/8357464
1•theanonymousone•6m ago•0 comments

The XGO Programming Language

https://github.com/goplus/xgo
1•quux0r•7m ago•0 comments

The Founder Mental Health Survey

https://www.foundermental.health/
2•jasonshen•10m ago•0 comments

Event-Sourced Domain Modeling

https://www.esdm.io/
4•goloroden•11m ago•0 comments

Socket raises $60M Series C at $1B valuation

https://socket.dev/blog/series-c
2•slymax•12m ago•0 comments

Children in Australia tell UK kids how a social media ban has affected them

https://www.bbc.co.uk/newsround/videos/cx21x5x05wno
3•DropDead•12m ago•0 comments

AI token streaming isn't about SSE vs. WebSockets

https://zknill.io/posts/ai-token-streaming-isnt-about-sse-vs-websockets/
1•zknill•12m ago•0 comments

Exploring Ref Qualifiers in C++

https://www.meetingcpp.com/blog/items/Exploring-ref-qualifiers-in-Cpp.html
1•jandeboevrie•14m ago•0 comments

Claw Patrol: an open-source security firewall for agents

https://deno.com/blog/clawpatrol
1•slymax•15m ago•0 comments

Apple Music's AI stance: "Technology should amplify artists, not replace them"

https://9to5mac.com/2026/05/20/apple-music-shares-what-it-is-doing-to-keep-music-fair-in-an-ai-wo...
1•mgh2•15m ago•0 comments

Just You Wait

https://shiflett.org/blog/2026/just-you-wait
3•speckx•17m ago•0 comments

ATmosphere 1.0.0 – Liftoff – ActivityPub for WordPress

https://activitypub.blog/2026/05/20/atmosphere-1-0-0-liftoff/
1•Tomte•17m ago•0 comments

The Trailer Gap

https://maxmautner.com/2026/05/19/the-trailer-gap.html
1•mslate•19m ago•0 comments

Show HN: Patchmark – LSP for reviewing code changes/diffs in text

https://radicle.network/nodes/radicle.dpc.pw/rad%3Az3sP3WnHgo1UfwmfmFM9a5cZSSEZR
1•dpc_01234•20m ago•0 comments

FOSS Weekly #26.21: Microsoft's Distro, Bitwarden Drama, and More

https://itsfoss.com/newsletter/foss-weekly-26-21/
3•abdelhousni•20m ago•2 comments

The Day the Food Noise Died

https://www.nytimes.com/2026/04/27/health/food-noise-obesity-drugs-glp-1.html
1•paulpauper•21m ago•0 comments

Are there any good archivers that work?

1•paulpauper•21m ago•1 comments

Serving Netflix Video Traffic at 400Gb/S and Beyond (2022) [pdf]

https://nabstreamingsummit.com/wp-content/uploads/2022/05/2022-Streaming-Summit-Netflix.pdf
3•tosh•23m ago•0 comments

Harvard Caps A's as Selective Colleges Attack Grade Inflation

https://www.nytimes.com/2026/05/20/us/harvard-grade-inflation.html
1•paulpauper•23m ago•0 comments

"No way to prevent this" say users of only language where this regularly happens

https://xeiaso.net/shitposts/no-way-to-prevent-this/CVE-2026-45584/
4•speckx•24m ago•0 comments

The Trust and Safety Professional Association and Trust and Safety Foundation

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6579678
2•hn_acker•25m ago•1 comments

Assay – validation layer for AI agents that touch money

https://github.com/VenturFlow/Assay
1•venturflow•25m ago•0 comments

Tanya Janca on AI Slop, Vibe Coding, & the Future of AppSec

https://redmonk.com/videos/tanya-janca/
1•mooreds•26m ago•0 comments

Newsmoji. Why read when you don't have to?

https://github.com/qarl/newsmoji
1•qarl•27m ago•0 comments