frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Batty – Run a team of AI coding agents in tmux with test gating

https://github.com/battysh/batty
1•Zedmor•1h ago
Hi HN, I'm the author.

I use Claude Code and Codex daily. Running one agent on a task works great. Running three or four in parallel on the same repo? They step on each other's files, nobody checks if the code compiles, and you spend more time coordinating than coding.

Batty is the supervisor layer I built to fix this. You define a team in YAML — an architect that plans work, a manager that dispatches it, engineers that execute. Batty launches each role in its own tmux pane, isolates engineer work in git worktrees, routes messages between roles, and gates task completion on passing tests.

The interesting part is what it's not: it's not an agent framework, and it doesn't embed any model. It orchestrates existing agent CLIs (Claude Code, Codex, Aider) using tmux as the runtime and git worktrees for isolation. Config is YAML, the kanban board is Markdown (powered by a bundled kanban-md tool), inboxes are Maildir, logs are JSONL. You can `git diff` your entire team state.

Built in Rust, published on crates.io (v0.1.0). The daemon is a synchronous 5-second poll loop — no async complexity. It watches pane output to detect idle/active/dead agents, reads Claude and Codex session files on disk to reduce false-positive idle detection, and uses a merge lock to serialize concurrent worktree merges.

Some things I learned running multi-agent setups:

- 3-5 parallel engineers is the sweet spot. Beyond that, the codebase itself becomes the bottleneck for absorbing parallel changes. - Task decomposition quality matters more than agent count. A good architect prompt outperforms throwing more engineers at bad tasks. - Test gating eliminated most of the chaos. Without it, agents "complete" work that breaks everything downstream. - You still need to supervise. It's not fire-and-forget — it's closer to managing a junior team. The leverage is supervising five workstreams instead of doing one.

I know there's prior art in this space — Tmux-IDE and vibe-kanban both approach multi-agent coordination differently. Batty is more opinionated about supervision: the test gating and communication constraints are first-class, not optional. Different tradeoffs for different workflows.

It's early (v0.1.0). The core loop is solid but the API is still settling. Eight built-in templates range from solo (1 agent) to large (19 agents with three management layers). The architecture diagram in the README shows the full supervision flow.

2-minute demo: https://youtube.com/watch?v=2wmBcUnq0vw Docs: https://battysh.github.io/batty

Happy to go deep on the architecture or the worktree strategy. For those running multiple agents: what's the biggest operational pain point?

Author of "Careless People" banned from saying anything negative about Meta

https://www.thetimes.com/uk/technology-uk/article/sarah-wynn-williams-careless-people-meta-nrffdfpmf
1•macleginn•27s ago•0 comments

Yes, Therapy Sessions Are Being Used to Train AI

https://www.thebignewsletter.com/p/yes-therapy-sessions-are-being-used
1•jyunwai•1m ago•0 comments

Getting to Know the Know-It-Alls: On a new history of pedantry

https://hedgehogreview.com/issues/humanism-in-a-posthumanist-age/articles/getting-to-know-the-kno...
1•cainxinth•2m ago•0 comments

OpenAI executive shuffle includes new role for COO

https://techcrunch.com/2026/04/03/openai-executive-shuffle-new-roles-coo-brad-lightcap-fidji-simo...
1•SilverElfin•3m ago•0 comments

New radio app in BETA –> Auralo

https://testflight.apple.com/join/mEtdrzZ5
1•marc0janssen•4m ago•1 comments

Weather Channel RetroCast Now Technical Breakdown

https://twitter.com/leemartin/status/2040428548964405402
1•leemartin•5m ago•1 comments

Landdown: Simple Sandboxing for Shell Scripts

https://git.sr.ht/~marcc/landdown
2•fanf2•6m ago•0 comments

Show HN: MyPDFBoy – Free PDF redaction that deletes the data

https://mypdfboy.com/
2•stanguc•6m ago•0 comments

Who was the actual target of the axios supply chain attack?

1•JohannMac•9m ago•0 comments

'The frontline is like Terminator': fighting robots give Ukraine hope in war

https://www.theguardian.com/science/2026/mar/16/a-molten-mushy-state-scientists-may-have-found-a-...
1•brkumar•11m ago•0 comments

Ice Age dice show early Native Americans may have understood probability

https://arstechnica.com/science/2026/04/ice-age-dice-show-early-native-americans-may-have-underst...
2•Brajeshwar•11m ago•0 comments

Boeing 777-9 Maximum Brake Energy Certification Testing [video]

https://www.youtube.com/watch?v=f2YQpDkqkn4
4•xnx•16m ago•2 comments

Apple's Spotlight Search Results Come with Engagement Metrics. No One Knew

https://www.buchodi.com/apples-spotlight-search-results-come-with-engagement-metrics-no-one-knew/
4•lmbbuchodi•20m ago•1 comments

Show HN: A simple iOS app that helps you give yourself some time"

https://apps.apple.com/tr/app/alnuo/id6761344069
3•sezginozgur•21m ago•0 comments

Lisette – Rust syntax, Go runtime

https://lisette.run/
1•darccio•25m ago•0 comments

An Abject Horror

https://blog.mempko.com/an-abject-horror/
2•mpweiher•27m ago•0 comments

Show HN: I made open source, zero power PCB hackathon badges

https://github.com/KaiPereira/Overglade-Badges
3•kaipereira•28m ago•0 comments

The Indie Internet Index – submit your favorite sites

https://iii.social
2•freshman_dev•30m ago•0 comments

Microsoft's new framework for building and orchestrating AI agents

https://github.com/microsoft/agent-framework
2•steveharing1•32m ago•0 comments

Autonomous Vulnerability Hunting with MCP

https://blog.zsec.uk/bullyingllms/
3•pedro84•35m ago•0 comments

Re_terminal: Start your digital resistance – Geyser

https://geyser.fund/project/reterminalonboardingtheresistance?hero=geyserpromotion
3•janandonly•36m ago•0 comments

Intelligence Cannot Be Trained?

2•hyperzzw•38m ago•1 comments

Show HN: Semsei — AI SEO for clicks, not impressions

https://www.semsei.io/en
2•andresdvelez•38m ago•2 comments

Wrote an Honest Comparison of Final Round AI vs. Pramp

https://portlumeai.com/blog/final-round-ai-vs-pramp-interview-prep-comparison
2•portlumeai•38m ago•0 comments

New Advances Bring the Era of Quantum Computers Closer

https://www.quantamagazine.org/new-advances-bring-the-era-of-quantum-computers-closer-than-ever-2...
3•Brajeshwar•39m ago•0 comments

Show HN: Tokencap – Token budget enforcement across your AI agents

https://github.com/pykul/tokencap
2•pykul•39m ago•0 comments

No One at Waffle House Remembers FEMA Official Who Says He Teleported In

https://www.nytimes.com/2026/04/03/us/fema-gregg-phillips-waffle-house-teleportation.html
4•planewave•40m ago•0 comments

Using model-agnostic agentic annotation tools on locally stored files

https://old.reddit.com/r/LLMDevs/comments/1sbusn8/new_pdfviewer_notes_panel_search_downloader_tool/
3•ieuanking•44m ago•0 comments

Slop is content without grounding

https://grady.io/slop-is-content-without-grounding/
3•gradys•44m ago•0 comments

Whoop Sues Bevel

https://xcancel.com/greyngyen/status/2040100105336799659
3•etothet•47m ago•0 comments