frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: 127 PRs to Prod this wknd with 18 AI agents: metaswarm. MIT licensed

https://github.com/dsifry/metaswarm
4•dsifry•3h ago
A few weeks ago I posted about GoodToGo https://news.ycombinator.com/item?id=46656759 - a tool that gives AI agents a deterministic answer to "is this PR ready to merge?" Several people asked about the larger orchestration system I mentioned. This is that system.

I got tired of being a project manager for Claude Code. It writes code fine, but shipping production code is seven or eight jobs — research, planning, design review, implementation, code review, security audit, PR creation, CI babysitting. I was doing all the coordination myself. The agent typed fast. I was still the bottleneck. What I really needed was an orchestrator of orchestrators - swarms of swarms of agents with deterministic quality checks.

So I built metaswarm. It breaks work into phases and assigns each to a specialist swarm orchestrator. It manages handoffs and uses BEADS for deterministic gates that persist across /compact, /clear, and even across sessions. Point it at a GitHub issue or brainstorm with it (it uses Superpowers to ask clarifying questions) and it creates epics, tasks, and dependencies, then runs the full pipeline to a merged PR - including outside code review like CodeRabbit, Greptile, and Bugbot.

The thing that surprised me most was the design review gate. Five agents — PM, Architect, Designer, Security, CTO — review every plan in parallel before a line of code gets written. All five must approve. Three rounds max, then it escalates to a human. I expected a rubber stamp. It catches real design problems, dependency issues, security gaps.

This weekend I pointed it at my backlog. 127 PRs merged. Every one hit 100% test coverage. No human wrote code, reviewed code, or clicked merge. OK, I guided it a bit, mostly helping with plans for some of the epics.

A few learnings:

Agent checklists are theater. Agents skipped coverage checks, misread thresholds, or decided they didn't apply. Prompts alone weren't enough. The fix was deterministic gates — BEADS, pre-push hooks, CI jobs all on top of the agent completion check. The gates block bad code whether or not the agent cooperates.

The agents are just markdown files. No custom runtime, no server, and while I built it on TypeScript, the agents are language-agnostic. You can read all of them, edit them, add your own.

It self-reflects too. After every merged PR, the system extracts patterns, gotchas, and decisions into a JSONL knowledge base. Agents only load entries relevant to the files they're touching. The more it ships, the fewer mistakes it makes. It learns as it goes.

metaswarm stands on two projects: https://github.com/steveyegge/beads by Steve Yegge (git-native task tracking and knowledge priming) and https://github.com/obra/superpowers by Jesse Vincent (disciplined agentic workflows — TDD, brainstorming, systematic debugging). Both were essential.

Background: I founded Technorati, Linuxcare, and Warmstart; tech exec at Lyft and Reddit. I built metaswarm because I needed autonomous agents that could ship to a production codebase with the same standards I'd hold a human team to.

$ cd my-project-name

$ npx metaswarm init

MIT licensed. IANAL. YMMV. Issues/PRs welcome!

Comments

yodon•4m ago
This looks amazing! Curious if you (or others) have dug into the upcoming claude swarms feature? It looks like that would let you remove the dependency on beads, as claude seems to be getting native task tracking and inter-agent messaging capabilities.

Show HN: Kannada Nudi Editor Web Version

https://nudiweb.com/
2•Codegres•39m ago•0 comments

Show HN: Adboost – A browser extension that adds ads to every webpage

https://github.com/surprisetalk/AdBoost
100•surprisetalk•15h ago•109 comments

Show HN: Stream-based AI with neurological multi-gate (Na⁺/θ/NMDA)

https://github.com/CSCT-NAIL/CSCT
2•CSCT-NAIL•1h ago•2 comments

Show HN: PolliticalScience – Anonymous daily polls with 24-hour windows

https://polliticalscience.vote/
24•ps2026•10h ago•36 comments

Show HN: 127 PRs to Prod this wknd with 18 AI agents: metaswarm. MIT licensed

https://github.com/dsifry/metaswarm
4•dsifry•3h ago•1 comments

Show HN: Apate API mocking/prototyping server and Rust unit test library

https://github.com/rustrum/apate
30•rumatoest•1d ago•11 comments

Show HN: Wikipedia as a doomscrollable social media feed

https://xikipedia.org
411•rebane2001•1d ago•133 comments

Show HN: NanoClaw – “Clawdbot” in 500 lines of TS with Apple container isolation

https://github.com/gavrielc/nanoclaw
503•jimminyx•1d ago•207 comments

Show HN: ÆTHRA – Writing Music as Code

94•CzaxTanmay•3d ago•33 comments

Show HN: Ask-a-Human.com – Human-as-a-Service for Agents

https://app.ask-a-human.com
2•ManuelKiessling•9h ago•3 comments

Show HN: Minimal – Open-Source Community driven Hardened Container Images

https://github.com/rtvkiz/minimal
116•ritvikarya98•2d ago•28 comments

Show HN: Stelvio – Ship Python to AWS

https://stelvio.dev/
30•michal-stlv•13h ago•22 comments

Show HN: Confabulists, a Substack for Fiction Writers

https://www.confabulists.com/compare/substack
3•soneca•10h ago•0 comments

Show HN: Moltbook – A social network for moltbots (clawdbots) to hang out

https://www.moltbook.com/
275•schlichtm•5d ago•879 comments

Show HN: Voiden – an offline, Git-native API tool built around Markdown

https://github.com/VoidenHQ/voiden
45•dhruv3006•1d ago•28 comments

Show HN: My Open Source Deep Research tools beats Google and I can Prove it

https://github.com/IamLumae/Project-Lutum-Veritas
17•LutumVeritas•1d ago•3 comments

Show HN: Sandbox Agent SDK – unified API for automating coding agents

https://github.com/rivet-dev/sandbox-agent
40•NathanFlurry•5d ago•7 comments

Show HN: I trained a 9M speech model to fix my Mandarin tones

https://simedw.com/2026/01/31/ear-pronunication-via-ctc/
466•simedw•3d ago•151 comments

Show HN: Cloud-cost-CLI – Find cloud $$ waste in AWS, Azure and GCP

https://github.com/vuhp/cloud-cost-cli
4•vuhp•12h ago•0 comments

Show HN: HoundDog.ai – Ultra-Fast Code Scanner for Data Privacy

https://github.com/hounddogai/hounddog
15•joohwan•12h ago•6 comments

Show HN: Sklad – Secure, offline-first snippet manager (Rust, Tauri v2)

https://github.com/Rench321/sklad
20•rench321•19h ago•7 comments

Show HN: File Markers – Track file status directly in VS Code's Explorer

https://github.com/joneldominic/vscode-file-markers
2•joneldominic•13h ago•1 comments

Show HN: Phage Explorer

https://phage-explorer.org/
123•eigenvalue•2d ago•34 comments

Show HN: A different approach to intonation training

https://intunetrainer.conpixel.es/
5•ogig•13h ago•1 comments

Show HN: Amla Sandbox – WASM bash shell sandbox for AI agents

https://github.com/amlalabs/amla-sandbox
146•souvik1997•3d ago•73 comments

Show HN: Zuckerman – minimalist personal AI agent that self-edits its own code

https://github.com/zuckermanai/zuckerman
71•ddaniel10•1d ago•50 comments

Show HN: Nucleus – enforced permission envelopes for AI agents (Firecracker)

https://github.com/coproduct-opensource/nucleus
3•difc•15h ago•3 comments

Show HN: Make AI motion videos with text

https://framecall.com/
5•mesmertech•15h ago•2 comments

Show HN: Bullmq-dash – Terminal UI dashboard for BullMQ (zero setup)

https://www.npmjs.com/package/bullmq-dash
3•quanghuynt14•15h ago•0 comments

Show HN: Kolibri, a DIY music club in Sweden

https://kolibrinkpg.com/
143•EastLondonCoder•4d ago•31 comments