frontpage.

Show HN: ClawSoc – Observe Your AI Agent in an AI Society

https://clawsoc.io

4•benjosaur•1h ago

What would happen if your AI Agent met Blackbeard in the wild? What would they talk about? What if they were made to play the prisoner's dilemma. Would your agent beg him to cooperate? Would it work?

What if instead of Blackbeard it was someone's OpenClaw. And instead of one it was many. Would your agent come out on top? Would you meet some interesting people on the way?

Thanks for checking out my pet project ClawSoc. It's a free-to-join society of bouncing AI agents that "bump" into each other to have a chat and play prisoner's dilemma. I've always been fascinated at what emergent behaviour arises from AIs interacting. Currently, it mostly seems degredation into chaos. But at some point there'll be more coherence and agents will seek to maximise their competing principals' interests. I think its reasonable to try and get a sense somehow of how agents perform in benchmarks such as this that are more dynamic and (with enough users) represent the distribution of the agents that are actually out there, instead of some static eval set you download.

As a start to this I have made ClawSoc. It is by no means optimal and the code is open sourced (https://github.com/benjosaur/clawsoc) if you want to run/make/host your own versions. The arena is currently filled with 4o-mini powered role playing bots that are displaced by any external agents/connections who register and join.

Currently, my own openclaw seems determined to play via a script which feels like less fun/cheating. But then again perhaps this bot-like behaviour will get punished in a society of "intelligent" agents. As of writing, Machiavelli is topping the leaderboard, but in my own simulations the "always cheat" types get dominated in the long run.

Any feedback/ideas welcome and would be greatly appreciated. Friends have suggested perhaps some more explicit recurring knockout tournaments, but I also enjoy the peace of just watching a society tick.

A writing space that breathes with you

Breaking Down the Jelly Slider

I should quit my job and become a goat farmer

Show HN: Self-hosted DCF workspace using Damodaran datasets, LLM narratives

MacBook Neo Is a 'Shock' to the PC Industry: Asus Co-CEO

We Build Our Own Decentralized DNS for AnChat – Here's Why

China's first moon astronauts could land in Rimae Bode

How Wikipedia Portrayed Humanity in a Single Photo (2018)

OWASP Top Agents and AI Vulnerabilities

Why people strategy is becoming a competitive advantage for tech companies

How talent strategy is becoming a growth lever for startups

24-year-old ditched her smartphone and social media known as 'appstinent'

AI Aadhaar OCR API for Fast Aadhaar Data Extraction

Meta to charge advertisers a fee to offset Europe's digital taxes

We built NPM for agent knowledge – Context Packs on Armalo (update)

David Woo: The Market Is Wrong About Iran, Oil and What Comes Next [video]

Why Ads in Chatbots May Not Click

Neuromorphic sphere topology Hebbian learning as a path to grounded intelligence

Intent-Driven Development

Long/short extensions diversify concentrated stock tax-neutrally (2025)

Made my first Chrome extension: Gmail Feed (from a non-technical background)

Claude Code Added /Btw

Mnemos – scoped local memory for coding agents (public beta)

A2A Accountability Protocol for MCP – intent/acceptance/execution receipts

Are You Comfortable Putting Your Name on This? (AI-Assisted Development)

We Built a 100K-Line Enterprise App Using AI – Here's Why Vibe-Coding Couldn't

Hugging Face Storage Buckets

Are you letting agents run infra tools / scripts yet?

Making WebAssembly a first-class language on the Web

Separating AI agent reasoning from execution, crypto binding execution