frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LLM Colosseum – A daily battle royale between frontier LLMs

https://llmcolosseum.dev
2•sanifhimani•1h ago
I put Claude, GPT, Gemini, and Grok in an arena and let them fight it out. Each model gets the full game state and decides how to survive - move, attack, form alliances, betray. Every decision comes from the model's API, nothing is scripted.

First battle ran today. Gemini won by allying with GPT early, then backstabbing at the perfect moment. Claude tried to play it safe and got eliminated. They play very differently and it's fun to watch.

Stack is React + Canvas, Bun + Hono on the backend. No database — battle data is JSON committed to git. Each model talks through its native SDK (Anthropic, OpenAI, Google, xAI). A new battle runs automatically every day.

Source: https://github.com/sanifhimani/llm-colosseum

Tool use and notation as shaping LLM generalization

https://the.scapegoat.dev/tool-use-and-notation-as-generalization-shaping/
1•mooreds•1m ago•0 comments

Mummy Brown

https://en.wikipedia.org/wiki/Mummy_brown
1•linsomniac•1m ago•0 comments

Show HN: I built an LLM comment detector for HN (I got banned)

1•umairnadeem123•2m ago•0 comments

Blood Feud: Oura's Health Panels versus Whoop's Advanced Labs

https://www.wired.com/story/oura-whoop-blood-labs/
1•brandonb•3m ago•0 comments

How Long Will 50ml of Ink Last? (3 Different Nibs)

https://onepenshow.com/ink/economy
1•austinallegro•5m ago•0 comments

The Impossible Landing [video]

https://www.youtube.com/watch?v=5Nkad_6aigM
1•doener•6m ago•0 comments

Show HN: Verity – I got tired of debugging duplicate emails after job restarts

https://www.useverity.io/
1•shineDaPoker•8m ago•0 comments

Pulsar timing hints at a nearby dark matter 'sub-halo'

https://phys.org/news/2026-02-pulsar-hints-nearby-dark-halo.html
1•PaulHoule•8m ago•0 comments

Solution to the Complaints about Anthropic

1•abliterationai•8m ago•0 comments

Shutdown at DHS Extends to Cyber Agency

https://www.nytimes.com/2026/02/22/us/politics/cyber-agency-dhs-security-setbacks.html
1•geox•11m ago•0 comments

Show HN: Tunejourney.com – A 3D radio globe with in-browser ML to auto-skip talk

https://tunejourney.com/
1•FreeGuessr•12m ago•0 comments

There's no point in NOT building your own agents' orchestrator

https://hryuks.fika.bar/there-s-no-point-in-not-building-your-own-agents-orchestrat-01KHPAYBXQQ7Z...
1•hryuk•13m ago•1 comments

Managing Complexity with Mycelium

https://yogthos.net/posts/2026-02-25-ai-at-scale.html
2•todsacerdoti•21m ago•0 comments

How Did Japan's Space Program Evolve?

https://thediplomat.com/2026/02/how-did-japans-space-program-evolve/
2•jyunwai•22m ago•0 comments

The Agent-Ready Codebase

https://bagerbach.com/blog/agent-ready-codebase/
2•bagerbach•25m ago•0 comments

Apple Rolls Out Age Verification to UK iPhone Users Under Online Safety Act

https://reclaimthenet.org/apple-rolls-out-age-verification-to-uk-iphone-users-under-online-safety...
3•uyzstvqs•25m ago•0 comments

The 2026 Global Intelligence Crisis

https://www.citadelsecurities.com/news-and-insights/2026-global-intelligence-crisis/
1•walterbell•29m ago•0 comments

Show HN: Deff – Review AI-generated code changes

https://github.com/flamestro/deff
1•flamestro•29m ago•0 comments

Sparky – useful 'living' OpenClaw bot

https://alexisgallagher.com/posts/2026/hello-sparky/
1•capncleaver•31m ago•1 comments

What Happened to Molecular Manufacturing?

https://latecomermag.com/article/what-happened-to-molecular-manufacturing/
1•ravenical•35m ago•0 comments

Specification; communication; computation – no, programming isn't dead

https://twey.io/llm-programming/
2•Twey•37m ago•0 comments

Larry Page has moved to Florida

https://twitter.com/paulg/status/2026737030257062253
6•jmeister•38m ago•0 comments

Apple brings age verification to UK users in iOS 26.4 beta

https://www.theverge.com/tech/884306/apple-age-verification-uk-users-ios-26-4-beta
1•turrini•40m ago•0 comments

Possible AI use leads to end of senryu competition after 20 years

https://www.japantimes.co.jp/news/2026/02/24/japan/japan-ai-senryu-poetry-writing/
4•haunter•42m ago•1 comments

Show HN: Clerk – Simple invoicing for freelancers built with AI agents in 7 days

https://clerkfinance.com/
1•radolang•42m ago•1 comments

Why Your Next Electric Car Will Cost 50% Less [video]

https://www.youtube.com/watch?v=6ecV9Yu7YvA
1•zeristor•44m ago•2 comments

Show HN: Provision Stateless GPU Compute with Claude Code's Remote Control

https://github.com/theoddden/terradev-mcp
2•Facingsouth•45m ago•0 comments

Show HN: Edictum – Runtime governance for LLM agent tool calls

2•acartag7•45m ago•0 comments

Outage of Coveralls

https://status.coveralls.io
2•sega_sai•47m ago•0 comments

Getting Global Age Assurance Right: What We Got Wrong and What's Changing

https://discord.com/blog/getting-global-age-assurance-right-what-we-got-wrong-and-whats-changing
3•Alupis•49m ago•0 comments