frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Glass box governance for multi-agent AI coding workflows

https://github.com/Vinix24/vnx-orchestration
1•vincentvandeth•1h ago

Comments

vincentvandeth•1h ago
Hey HN. I've been running multi-agent AI coding workflows in production for 6 months now, and VNX is the governance system I built to make it actually work. The problem isn't getting AI agents to write code — it's knowing when they went wrong, why, and preventing the same failure next time.

Every multi-agent framework I tried solved the demo but collapsed in production: no audit trail, no way to scope tasks, no quality enforcement, and when something broke three agents deep, no way to trace it.

VNX is a different approach. Four components, all filesystem-based:

1. Dispatch queue — T0 (orchestrator) breaks work into scoped tasks (150-300 lines max) and routes them to worker terminals. Each terminal runs its own AI CLI (Claude Code, Codex CLI, or Gemini CLI) with its own context window. No shared state between agents.

2. Receipt ledger — Every agent completion produces an append-only NDJSON receipt: what was dispatched, what was produced, which git commit, which files changed, duration, cost. After 1100+ entries, patterns emerge that you can't see any other way — which task types fail most, which agents struggle with which skills, where context pollution actually happens.

3. Quality gates — Deterministic, not LLM-based. The agent proposes, the gate validates: file size limits, test coverage thresholds, open blocker counts. Verdicts are APPROVE, HOLD, or ESCALATE. The LLM never decides whether its own work is good enough.

4. Context rotation — When an agent's context window fills up mid-task, a 3-hook pipeline detects it at 65%, has the agent write a structured handover, clears the session via tmux, and resumes with a fresh context window. Zero lost work, zero human intervention.

The whole thing runs in a 2x2 tmux grid. T0 orchestrates, T1-T3 execute. The terminal layout IS the architecture — each pane is a fully observable, independent agent session. I can read every thought, every tool call, every mistake. That's what "glass box" means: the opposite of agents calling agents inside a shared process where you're debugging the framework's abstractions.

Try it without any LLM:

git clone https://github.com/Vinix24/vnx-orchestration.git

cd vnx-orchestration/demo/dry-run bash replay.sh --fast

This replays a real 6-PR development session with dispatches, receipts, quality verdicts, and open item resolution.

There's also a context rotation demo in demo/dry-run-context-rotation/.

What VNX is NOT: not a SaaS, not a framework you import into code, not an agent builder. It's bash + python, local-first, no database, no cloud dependency. MIT licensed.

What I'd love to discuss: governance approaches for AI agents in general. Quality gates, audit trails, scoping strategies — I think this is the actual hard problem in multi-agent systems, not the orchestration itself. Curious what patterns others have found.

How the Psychedelic Drug Ibogaine Changed Me Forever

https://www.nytimes.com/2026/03/01/magazine/ibogaine-psychedelic-treatment-trauma-mental-health.html
1•WarOnPrivacy•15s ago•1 comments

US Military reportedly used Claude in Iran strikes despite Trump's ban

https://www.theguardian.com/technology/2026/mar/01/claude-anthropic-iran-strikes-us-military
1•enceladus06•16s ago•0 comments

The Long Afterlife of the Console Modchip

https://hackaday.com/2026/03/01/the-long-afterlife-of-the-console-modchip/
1•umairnadeem123•1m ago•0 comments

Show HN: Boucle – A self-dogfooding autonomous AI agent framework in Rus

https://github.com/Bande-a-Bonnot/Boucle-framework
1•ElFitz•1m ago•0 comments

An audio modem in 5 lines of Awk

https://pagedout.institute/webview.php?issue=8&page=24&article=An+AWKward+Modem
1•beefburger•3m ago•0 comments

Be the Village Rome Can't Read

https://abiawomosu.substack.com/p/be-the-village-rome-cant-read
1•laurex•5m ago•0 comments

To launch an online business in 2026 you needed

https://www.demeria.app
1•portudov•6m ago•1 comments

How to enforce contracts in API development?

1•arter45•7m ago•0 comments

The Claw – The First AI-Powered Digital Media Publication

https://theclawnews.ai/
1•ulrischa•8m ago•0 comments

How to Run a Small Social Network Site for Your Friends

https://runyourown.social/
2•TigerUniversity•8m ago•0 comments

Show HN: Fail-closed execution guard for AI agents (Python, pip installable)

https://github.com/Nick-heo-eg/ai-execution-boundary-core
1•echo_os•9m ago•0 comments

DealMaker Uses Morning Brew and Robinhood to Lure Retail Investors

https://hntrbrk.com/shark-tank/
1•amadeuspagel•10m ago•0 comments

A cellular atlas of aging comes into focus

https://longevity.technology/news/a-cellular-atlas-of-aging-comes-into-focus/
1•Bender•10m ago•0 comments

P3: Reputation-based lending with developer API

https://developers.p3lending.space/
1•p3lending•10m ago•1 comments

Supabase Blocked in India: random proxies are on market

https://harshanu.space/en/tech/dumb-vibe-coders/
1•anxiousvater•10m ago•0 comments

Show HN: I Built Context+ AST and Embeddings for Codebase Understanding

https://contextplus.vercel.app/
1•forloopcodes•11m ago•0 comments

Republican Steve Hilton surges ahead in California governor's race

https://ktla.com/news/politics/inside-california-politics/california-governor-poll-february-2026/
1•Bender•12m ago•0 comments

'Enshittification' blamed for fewer NZers feeling positive about the internet

https://www.rnz.co.nz/news/national/588339/enshittification-blamed-for-fewer-nzers-feeling-positi...
3•billybuckwheat•12m ago•0 comments

MicroGPT Explained Interactively

https://growingswe.com/blog/microgpt
1•andsoitis•14m ago•0 comments

I Programmed an AI Bot to Help Me Run for President (2020)

https://medium.com/linebyline/presidential-speech-algorithm-ec755995887a
1•simonebrunozzi•14m ago•0 comments

US and Israel strike Iran, raising oil supply security risks

https://www.spglobal.com/energy/en/news-research/latest-news/crude-oil/022826-us-and-israel-strik...
1•vedantnair•15m ago•0 comments

U.S. service members killed in Iran operation, military says

https://www.washingtonpost.com/world/2026/03/01/trump-iran-israel-khamenei-strikes-live-updates/
1•arunabha•15m ago•0 comments

Afghan Taliban open to talks after Pakistan bombs Kabul, Kandahar

https://www.reuters.com/world/asia-pacific/pakistan-strikes-afghanistan-targets-clashes-intensify...
1•vedantnair•15m ago•0 comments

Trump says 9 Iranian warships have been sunk

https://ktla.com/news/nationworld/ap-trump-warns-iran-not-to-escalate-attacks-saying-us-will-stri...
1•Bender•16m ago•0 comments

FBI investigating 'potential nexus to terrorism' in deadly mass shooting

https://www.cnn.com/2026/03/01/us/deadly-shooting-austin-entertainment-district
2•vedantnair•16m ago•0 comments

Freecode: A $0 coding agent auto-picks the best free LLM (~300 lines of Rust)

https://github.com/mr-kelly/freecode
1•chepy•16m ago•0 comments

The Factory Model: How Coding Agents Changed Software Engineering

https://addyosmani.com/blog/factory-model/
1•ulrischa•16m ago•0 comments

Free, real-time, AI-powered OSINT dashboard with 180 data feeds

https://twitter.com/danushman/status/2028007602391540026
1•taubek•16m ago•0 comments

Show HN: Aura-State – A Formally Verified LLM State Machine Compiler

1•rohanmunshi08•16m ago•0 comments

Show HN: Spekkio: Reverse-engineer specs from vibe-coded apps

https://github.com/paulkarayan/spekkio
1•tessier0ashpool•17m ago•0 comments