news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: ParseBench – Document parsing benchmark for AI agents

https://www.parsebench.ai/

5•pierre•1h ago

Comments

pierre•1h ago

Build a benchmark to evaluate how good document parser work on a dataset of 2000 PDFs manually annotated, trying to evaluate accross multiple dimensions: charts, tables, text styling, text correctness, and attribution.

The benchmark evaluate performance on full page (not selected part of the pages), and evaluaye different OSS / crobtier model / commercial approach.

For transparency it is available as a HF leaderbaord.

Paper: https://arxiv.org/abs/2604.08538

ilikebigcutts•1h ago

This is cool. Does it consider frontier VLMs?

pierre•1h ago

Yes, it evaluate using frontier model for parsing from all 3 major provider (google, anthropic and openai). It is also easy to extend to evaluaye new model (code/dataset is available)

500 Tbps of capacity: 16 years of scaling our global network

https://blog.cloudflare.com/500-tbps-of-capacity/

1•sp8•42s ago•0 comments

Nothing Ever Happens: Polymarket bot that always buys No on non-sports markets

https://github.com/sterlingcrispin/nothing-ever-happens

1•m-hodges•1m ago•0 comments

France is replacing 2.5M Windows desktops with Linux, here its new stack

https://www.zdnet.com/article/france-leaves-windows-for-linux-desktop/

1•CrankyBear•1m ago•0 comments

Triton

https://triton-lang.org/main/index.html

2•tosh•1m ago•0 comments

Intel Releases OpenVINO 2026.1 with Back End for Llama.cpp, New Hardware Support

https://www.phoronix.com/news/OpenVINO-2026.1-Released

1•wb14123•1m ago•0 comments

Alpine Divorce: A Hike That Ends a Relationship

https://www.nytimes.com/2026/04/12/style/alpine-divorce-relationships-hike.html

1•mooreds•2m ago•0 comments

Nordics and Estonia rolling out offline card payment backup in case internet cut

https://www.reuters.com/business/finance/nordics-estonia-plan-offline-card-payment-back-up-if-int...

2•_____k•3m ago•0 comments

Show HN: Context Surgeon – Let AI agents edit their own context window

https://github.com/jackfruitsandwich/context-surgeon

1•jdjdjdi•3m ago•0 comments

The Capability Im-Maturity Model (CIMM) (2003)

https://web.archive.org/web/20030117052912/http://www.stsc.hill.af.mil/crosstalk/1996/11/xt96d11h...

1•rzk•3m ago•0 comments

Forgejo prohibits AI-generated work

https://codeberg.org/forgejo/governance/src/commit/57bf0779bec61e2facd1679efc9bc5839e631d40/AIAgr...

2•singiamtel•4m ago•1 comments

Hybrid Constructions: The Post-Quantum Safety Blanket

https://soatok.blog/2026/04/13/hybrid-constructions-the-post-quantum-safety-blanket/

1•some_furry•4m ago•0 comments

Grassroots Fediverse Evolution

https://coding.social/blog/grassroots-evolution/

1•paulnpace•4m ago•0 comments

I like to use Soviet control panels as a starting point

https://unsung.aresluna.org/i-like-to-use-soviet-control-panels-as-a-starting-point/

2•speckx•5m ago•0 comments

America is done – dominican republic takes lead

https://bitcoin-zero-down-2ea152.gitlab.io/

1•machardmachard•5m ago•0 comments

Show HN: CCS – CLI to switch Claude Code profiles with different MCP servers

https://github.com/virtuallytd/claude-code-switcher

1•virtuallytd•5m ago•0 comments

Apple's AI Chief John Giannandrea Departs This Week

https://www.macrumors.com/2026/04/13/john-giannandrea-departs-apple-this-week/

1•tosh•8m ago•0 comments

ALTK‑Evolve: On‑the‑Job Learning for AI Agents

https://huggingface.co/blog/ibm-research/altk-evolve

1•gmays•9m ago•0 comments

Quality and Suffering in Software Delivery

https://staffordwilliams.com/blog/2026/02/01/quality-and-suffering/

1•rzk•10m ago•0 comments

The Dumbest Hack of the Year Exposed a Real Problem

https://www.wired.com/story/crosswalk-city-hack-cybersecurity-lessons/

1•Brajeshwar•10m ago•1 comments

I Quit Drinking for a Year

https://dynomight.substack.com/p/drinking

1•paulpauper•11m ago•0 comments

Taxes Were Designed to Suck

https://yourbrainonmoney.substack.com/p/your-taxes-were-designed-to-suck

2•jader201•12m ago•0 comments

The Fundamental Dilemma of Schooling

https://arnoldkling.substack.com/p/the-fundamental-dilemma-of-schooling

1•paulpauper•12m ago•0 comments

Framework Laptop magnetic charging plug

https://community.frame.work/t/oshe-framework-magnetic-charging-connector-card/81798

1•sounds•13m ago•0 comments

Apps and programming: two accidental tyrannies

https://andymatuschak.org/tat/

1•surprisetalk•14m ago•0 comments

Show HN: IceGate – Observability data lake engine

https://github.com/icegatetech/icegate

5•mineev•14m ago•2 comments

Code Deployment: The self-hosted way

https://priyatham.in/en/post/deploy-websites/

1•vasquezempereur•14m ago•1 comments

All Writers Will End Up AI-Maxxing, and This Is Good

https://www.richardhanania.com/p/all-writers-will-end-up-ai-maxxing

2•paulpauper•15m ago•1 comments

The Worst Coded Item in Dota 2 [video]

https://www.youtube.com/watch?v=KHGVlWQBvuE

1•skibz•15m ago•0 comments

Show HN: 15 yrs of Django in prod: patterns I keep using (agent skills)

https://github.com/dvf/opinionated-django

2•vanflymen•15m ago•1 comments

Ask HN: What's the best AI model for system design nowadays?

2•jcremona•17m ago•0 comments