frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Type Chart Calculator – Interactive type effectiveness tool

https://www.typematchup.org/
1•lincyang•30s ago•0 comments

Software engineering when machine writes the code

https://www.shayon.dev/post/2026/19/software-engineering-when-the-machine-writes-code/
1•shayonj•32s ago•0 comments

Cloudflare Workers performance: an experiment with Astro and worldwide latencies

https://blog.angelside.net/cloudflare-workers-performance-an-experiment-with-astro-and-worldwide-...
1•dagnelies•2m ago•0 comments

Claude Code configured the DNS for this website

https://rubenflamshepherd.com/articles/2026-01-20-claude-code-configured-the-dns
1•rubenflamshep•2m ago•0 comments

AGP 9.0 is out, and it's a disaster. Here's a full migration guide

https://old.reddit.com/r/androiddev/comments/1qi110y/agp_90_is_out_and_its_a_disaster_heres_full/
1•phreack•4m ago•0 comments

What chat app does Trump use?

1•sam_lowry_•4m ago•0 comments

Show HN: PIICloak – Open-source PII detection API (31 entity types, self-hosted)

https://github.com/dimanjet/piicloak
1•dimanjet•4m ago•0 comments

Do World Cup teams need a 50% prize money hike after tickets furore?

https://www.theguardian.com/football/2025/dec/18/fifa-world-cup-2026-ticket-prices-prize-money
1•PaulHoule•5m ago•0 comments

Ask HN: How Do You Find Interesting GitHub Projects and Repositories?

1•karakoram•5m ago•0 comments

Book Towns Are Made for Book Lovers

https://www.atlasobscura.com/articles/what-is-a-book-town
1•Brajeshwar•6m ago•0 comments

Toxic Hydrogen Cyanide and Its Role in the Origins of Life

https://www.universetoday.com/articles/toxic-hydrogen-cyanide-and-its-role-in-the-origins-of-life
1•Brajeshwar•6m ago•0 comments

Observing the positronium beam as a quantum matter wave

https://phys.org/news/2026-01-positronium-quantum.html
1•Brajeshwar•6m ago•0 comments

2025 Prize in the Mathematics of Artificial Intelligence

https://amathr.org/prizes/aiprize25/
1•simonpure•6m ago•0 comments

Show HN: Git analytics that works across GitHub, GitLab, and Bitbucket

2•inferno22•8m ago•0 comments

Show HN: A C library written in Rust for querying kernel configuration file

https://github.com/synalice/kconfq
1•synalice•8m ago•0 comments

Assert your way to stronger technical writing

https://fogknife.com/2025-04-04-assert-your-way-to-stronger-technical-writing.html
1•wonger_•9m ago•0 comments

Ask HN: What are the Recommender Systems papers from 2024-2025?

1•haensi•9m ago•0 comments

Open Responses

https://www.openresponses.org/
1•jonbaer•11m ago•0 comments

TopicRadar – Track trending topics across Hacker News, GitHub, ArXiv, and more

https://apify.com/mick-johnson/topic-radar
1•MickolasJae•11m ago•1 comments

Unified API for All TTS Models. Who's In?

1•akshat77•12m ago•0 comments

The Complete Guide to Claude.md

https://www.builder.io/blog/claude-md-guide
1•speckx•12m ago•0 comments

A ten-year review of the Cambridge Cybercrime Centre (2025) [pdf]

https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-1003.pdf
1•mooreds•12m ago•0 comments

How to Create a Plug-In, in Ada

https://old.reddit.com/r/ada/comments/1qhzwzz/how_to_create_a_plugin/~
2•ajdude•12m ago•0 comments

Tech titans lined up for Trump's second inauguration. Now they're even richer

https://www.ft.com/content/674b700e-765d-44e0-ba30-13b0c6c5abf1
3•Noaidi•12m ago•0 comments

Covid vaccination and post-infection cancer signals [pdf]

https://brownstone.org/wp-content/uploads/2026/01/oncotarget-26-049705-PUBLISHED-2.pdf
1•bitcoin_anon•13m ago•0 comments

The USA Lock-In: When Tech Dependency Becomes Geopolitical Vulnerability

https://blog-e530b5.gitlab.io/posts/usa-lock-in/
2•robtherobber•14m ago•0 comments

Python, Is It Being Killed by Incremental Improvements?

https://stefan-marr.de/2026/01/python-killed-by-incremental-improvements-questionmark/
1•matt_d•15m ago•0 comments

In Pursuit of Production Minimalism (2017)

https://brandur.org/minimalism
1•tosh•16m ago•0 comments

Show HN: Responsive Bento Grid implementation using Tailwind CSS (no heavy libs)

https://veloxweb.gumroad.com/l/launch-ui
1•asliper•16m ago•1 comments

Show HN: Claude Skill Editor

https://github.com/mtct/skill-editor
2•mtct88•18m ago•0 comments
Open in hackernews

Show HN: Ocrbase – pdf → .md/.json document OCR and structured extraction API

https://github.com/majcheradam/ocrbase
17•adammajcher•1h ago

Comments

mechazawa•49m ago
Is only bun supported or also regular node?
hersko•39m ago
I have a flow where i extract text from a pdf with pdf-parse and then feed that to an ai for data extraction. If that fails i convert it to a png and send the image for data extraction. This works very well and would presumably be far cheaper as i'm generally sending text to the model instead of relying on images. Isn't just sending the images for ocr significantly more expensive?
mimim1mi•12m ago
By definition, OCR means optical character recognition. It depends on the contents of the PDF what kind of extraction methodology can work. Often some available PDFs are just scans of printed documents or handwritten notes. If machine readable text is available your approach is great.
sgc•4m ago
How does this compare to dots.ocr? I got fantastic results when I tested dots.

https://github.com/rednote-hilab/dots.ocr