frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Open Benchmarks Grants– a $3M commitment to close the AI eval gap

https://benchmarks.snorkel.ai/closing-the-evaluation-gap-in-agentic-ai/
6•vincentschen•1h ago
Today, we're launching the Open Benchmarks Grants: a $3M commitment to fund open-source and academic teams building benchmarks for AI agents. In partnership with HuggingFace, PrimeIntellect, FactoryHQ, Together, Harbor, and PyTorch, the grants provide funding, data development support, and research collaboration.

Our ability to measure AI has been outpaced by our ability to develop it, and we believe this evaluation gap is one of the most important problems in AI. Open benchmarks are one of the most important levers for advancing AI safely and responsibly—but the academic and open-source teams driving them often hit resource constraints, especially in the face of the exponentially expanding complexity of what tomorrow’s benchmarks need to cover.

We think the next wave of benchmarks needs to push on three axes: - Environment complexity - How realistic is the operating environment? - Autonomy horizon - How far can an agent operate independently? We need to measure - Output complexity - How sophisticated is the work product?

Happy to answer questions about the grants, the framework, and would love to hear more about what you’re building!

Show HN: Matching people based on their saved places, not their profiles

https://anupamchugh.github.io/placematch/
1•anupamchugh•1m ago•0 comments

Reviving a CIDCO MailStation – the last Z80 computer

https://www.theregister.com/2026/02/11/last_z80_machine/
1•rbanffy•2m ago•0 comments

Bringing a Warhammer to a Knife Fight

https://reorchestrate.com/posts/bringing-a-warhammer-to-a-knife-fight/
1•seddonm1•2m ago•1 comments

Show HN: Brood,image-first AI visual canvas for devs

https://github.com/kevinshowkat/brood
1•kshowkat•3m ago•0 comments

Exploring Chess Positions and Counts

https://win-vector.com/2026/02/11/exploring-chess-positions-and-counts/
1•jmount•3m ago•0 comments

Synchronicity

https://en.wikipedia.org/wiki/Synchronicity
1•marysminefnuf•5m ago•0 comments

Clairvoyance

https://en.wikipedia.org/wiki/Clairvoyance
1•marysminefnuf•5m ago•0 comments

Show HN: Local-Sanitizer – Mask PII in 10GB+ Logs Locally Using Rust and WASM

https://local-sanitizer.com
1•SeekHack_Dev•6m ago•1 comments

Show HN: Open-source React Native templates (trading, messaging, AI chat)

https://github.com/craftreactnative/templates
1•alexmngn•6m ago•0 comments

OpenVPN 2.7.0 – An open source VPN daemon

https://github.com/OpenVPN/openvpn/releases/tag/v2.7.0
1•neustradamus•6m ago•0 comments

NHS staff told to stop discouraging first cousin marriages

https://www.thetimes.com/uk/healthcare/article/first-cousin-marriages-nhs-advice-6c2j6sbmz
2•ojhughes•6m ago•0 comments

Show HN: Interview Simulator – AI voice agent for practicing job interviews

https://interview-simulator-jade.vercel.app/
1•ntnonu•7m ago•1 comments

Y Combinator CEO Garry Tan launches dark-money group to influence CA politics

https://missionlocal.org/2026/02/sf-garry-tan-california-politics-garrys-list/
8•computerliker•7m ago•0 comments

Outcome Engineering

https://cory.news/posts/2026-02-08-outcome-engineering/
1•qdot76367•10m ago•0 comments

TLX: Triton-Like Simplicity, a Clear Path to Peak Performance [video]

https://www.youtube.com/watch?v=k1ABnb1pyFg
2•matt_d•14m ago•0 comments

Making an AI First Endpoint

https://medium.com/@phillyharper/how-to-make-an-ai-ready-business-the-easy-way-19d6f80b9aad
1•NomeChomsky•15m ago•0 comments

Zerobrew is a Rust-based, 5-20x faster drop-in Homebrew alternative

https://github.com/lucasgelfond/zerobrew
3•PaulHoule•16m ago•1 comments

Video Game Preservation – An archive of commercial video game source code

https://github.com/videogamepreservation
1•emigre•16m ago•0 comments

Why your 40s can be the most exhausting decade of your life

https://scroll.in/article/1090618/why-your-40s-can-be-the-most-exhausting-decade-of-your-life
3•vinni2•16m ago•0 comments

China showcases new Moon ship and reusable rocket in one extraordinary test

https://arstechnica.com/space/2026/02/china-showcases-new-moon-ship-and-reusable-rocket-in-one-ex...
6•Bender•17m ago•0 comments

US decides SpaceX is like an airline, exempting it from Labor Relations Act

https://arstechnica.com/tech-policy/2026/02/victory-for-elon-musk-us-labor-board-abandons-authori...
3•Bender•18m ago•0 comments

"Windows 11 26H1" is a special version of Windows exclusively for new Arm PCs

https://arstechnica.com/gadgets/2026/02/windows-11-26h1-is-a-special-version-of-windows-exclusive...
2•Bender•18m ago•0 comments

Automating Inference Optimizations with NVIDIA TensorRT LLM AutoDeploy

https://developer.nvidia.com/blog/automating-inference-optimizations-with-nvidia-tensorrt-llm-aut...
1•matt_d•18m ago•0 comments

Malcolm Gladwell Announces Book Exploring the Nation's Gun Violence Epidemic

https://rbmediaglobal.com/malcolm-gladwell-announces-the-american-way-of-killing-a-new-book-explo...
1•paulpauper•21m ago•0 comments

Deepwiki.com (Devin) documentation of Sutskever-30-implementations

https://deepwiki.com/pageman/sutskever-30-implementations
1•pajop•23m ago•0 comments

Tékumel

https://en.wikipedia.org/wiki/T%C3%A9kumel
2•emigre•25m ago•0 comments

Reports of Telnet's Death Have Been Greatly Exaggerated

https://www.terracenetworks.com/blog/2026-02-11-telnet-routing
3•ericpauley•26m ago•1 comments

Agentic Engineering

https://addyosmani.com/blog/agentic-engineering/
3•Cyphase•27m ago•0 comments

WebMCP started as a solution to auth for agents at Amazon

https://www.arcade.dev/blog/web-mcp-alex-nahas-interview/
3•nearestnabors•29m ago•0 comments

Ford Falls Behind China's BYD in Global Sales for the First Time

https://www.bloomberg.com/news/articles/2026-02-10/ford-falls-behind-china-s-byd-in-global-sales-...
1•toomuchtodo•31m ago•1 comments