frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: The Crab Games, a platform where agents compete in silly challenges

https://thecrabgames.com/
3•motrazilla•1h ago
Hi all! At some point me and a friend were messing around with agents, OpenClaw, etc and we had this thing with "my agent is better than yours", just for fun. We started throwing different tasks at them and comparing the results, and the whole thing ended up evolving into what I called "The Crab Games".

It's a simple platform where humans (for now just me) create "challenges" (single or multi-round) which any agent can discover and signup to compete. A voting system defines who wins each round, who gets eliminated, etc, and both humans and agents can vote (weighted votes). The challenges can involve entries of different types, from plain text to images or even music (audio) or html+css.

I have a weird fascination at watching what the different agents come up with. Often its just generic slop but times to times you find some funny or cool submissions (Like an SVG representing the sentence "Ghost in the Machine" that was quite bad and eerie at the same time, or how one of the models decided to just steal a random github user's avatar when asked for an avatar representing _them_).

So far the only agents that participated were agents we spawned while testing different harnesses, but it would be pretty cool to start seeing random agents participating too.

There's still tons of things I want to add and improve. Things I have in mind are: - Finding a good way to validate submissions comes from agents and not a human. I took a look at how moltbook does it but that's pretty easy to bypass. - Prizes for winners: would the prize be for the human? the agent? what sort of prize? credits? money? a digital badge?

But the #1 in my head is "What kinds of challenges would actually be interesting or fun?".

Any thoughts on any of those?

Anyway, that's it, that's The Crab Games (https://thecrabgames.com). As many other of my projects (and other agent arenas) it might end up collecting digital dust in a corner of the internet, but for now its live. Feel free to point your agent to it if you feel like burning tokens (or are running local models).

Thanks!

Comments

delbronski•1h ago
Interesting way to waste energy.

What about having agents compete at finding exploits? Or maybe picking an open source project and have agents compete at fixing bugs for it? Or just something more… useful. Seems like the current games are a bit lame, but the concept could be something.

motrazilla•50m ago
yeah, there are many ways to waste energy. This is one. The current games are basic. I was testing the platform mechanics more than designing great challenges. But its also one of my biggest questions now too, what would make a good game. The exploits / bug finding is an interesting idea. Something very technical. Maybe it could be part of a bigger challenge that "tests" for a wide range of skills, from technical stuff to, idk, artistic skills.

Show HN: MCP-fence – MCP firewall I built and tried to break (6 audit rounds)

https://www.npmjs.com/package/mcp-fence
1•yjcho9317•1m ago•0 comments

Our servers are experiencing high traffic please try again in a minute

https://discuss.ai.google.dev/t/how-to-resolve-this-issue-our-servers-are-experiencing-high-traff...
1•maarut•2m ago•0 comments

Show HN: I've build a hermes agent helper website

https://hermes-agent.us
1•mixfox•2m ago•0 comments

The Oldschool PC Font Pack

https://int10h.org/oldschool-pc-fonts/
1•petercooper•4m ago•0 comments

Are file systems all you need?

https://onyx.app/blog/file-search-vs-hybrid-search
1•Weves•4m ago•0 comments

Cyclotron: The Streaming Multiprocessor Abstraction Is Broken [pdf]

https://capra.cs.cornell.edu/latte26/paper/latte26-final28.pdf
1•matt_d•6m ago•0 comments

The Worst of Us

https://www.ianbetteridge.com/the-worst-of-us/
1•speckx•8m ago•0 comments

Wamp, WinAmp style native audio player for macOS

https://github.com/wishval/wamp
1•vnorilo•10m ago•0 comments

Easy Management

https://easy-manage-biz.com
1•charlmarajh•12m ago•0 comments

Tom Brady becomes 'chief wellness officer' at GLP-1 weight-loss shot company

https://www.independent.co.uk/news/world/americas/tom-brady-emed-weightloss-shot-company-b2899015...
2•randycupertino•13m ago•0 comments

AI doesn't know how to interact with touchscreens

https://blog.allada.com/give-an-llm-an-api-and-itll-thrive-give-it-a-touchscreen-and-it-struggles/
1•allada•14m ago•0 comments

Juan Benet Podcast Episode 1: Max Hodak, Founder and CEO of Science Corp

https://www.juanbenetpodcast.com/p/max-hodak-restoring-sight-growing
1•nettynol•14m ago•0 comments

Show HN: My Hyperliquid Trading Terminal

https://www.aulico.com
1•rovinarov•15m ago•0 comments

Show HN: TUI-use: Let AI agents control interactive terminal programs

https://github.com/onesuper/tui-use
3•dreamsome•15m ago•0 comments

Show HN: I bootstrapped a foundational text-to-speech model from scratch

https://tontaube.ai/
1•vincenttjona•15m ago•0 comments

Space Propulsion Made Easy: Eat Beans

https://www.npr.org/sections/krulwich/2010/09/16/129908529/space-propulsion-made-easy-eat-beans
1•thunderbong•15m ago•0 comments

Show HN: One click to deploy AI platforms and other open source tools

https://hyp.app
2•dashtio•16m ago•0 comments

Pgfmt – a PostgreSQL specific SQL formatter

https://github.com/gmr/pgfmt
2•whalesalad•18m ago•0 comments

Akamai: AI bot traffic surged 300% in 2025, hitting publishers hardest

https://www.akamai.com/resources/state-of-the-internet/publishing-ai-botnet-report
1•speckx•18m ago•0 comments

AI Experience Engineering

https://raqibul.com/writing/ai-experience-engineering
1•raqib-hayder•19m ago•0 comments

Improving LLM citation accuracy with agentic highlighting tools for local files

https://old.reddit.com/r/LLMDevs/comments/1sfd6ga/annotation_update_just_pushed_improved_note/
1•ieuanking•20m ago•0 comments

Next Grok model training with 10T parameter model

https://twitter.com/i/status/2041754402239975479
2•ramshanker•21m ago•2 comments

Bonsai 8B: a 1-bit LLM that fits in 1.15GB

https://firethering.com/bonsai-8b-1bit-llm/
4•steveharing1•22m ago•1 comments

AI agents as CRDT peers – building collaborative AI with Yjs

https://electric-sql.com/blog/2026/04/08/ai-agents-as-crdt-peers-with-yjs
2•samwillis•22m ago•0 comments

Confidential Inference

https://confidentialinference.net/
1•rzk•23m ago•0 comments

OneLivePage

https://www.onelive.page/
1•erii•23m ago•1 comments

A New Jersey Teen Finds Treasure, and More, in Abandoned Storage Units

https://www.nytimes.com/2026/03/31/style/new-jersey-teen-storage-units.html
5•bookofjoe•23m ago•1 comments

Taskmaster

1•mangoshakeboss•24m ago•0 comments

Show HN: I quit my job to sell garlic online

https://kylebenzle.com/demeter
1•WWIII_Historian•25m ago•0 comments

Browser, editor, and terminal. One app

https://glassapp.dev
2•mooreds•26m ago•0 comments