frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Multi-Agent-Coder Is #12 on Stanford's TBench. Beats Claude Code

https://github.com/Danau5tin/multi-agent-coding-system
1•Danau5tin•3h ago
This weekend I built a multi-agent coding system which, quite unexpectedly, beat Claude Code on Stanford's Terminal Bench!

The architecture is straightforward, consisting of an orchestrator agent that deploys explorer & coder subagents to complete complex terminal based tasks, utilising an intelligent context sharing mechanism along the way which makes it all work.

The repo has a lot of technical details, and all the code and prompts for you to play around with if you'd like!

I had a lot of fun making this, I hope you have fun reading the README, using it yourself, or even extending it!

As always, a huge thanks to the great team behind Terminal Bench. It is a great benchmark.

Thanks for reading, Dan