frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Real-time dashboard for Claude Code agent teams

https://github.com/simple10/agents-observe
34•simple10•1h ago
This project (Agents Observe) started as an exploration into building automation harnesses around claude code. I needed a way to see exactly what teams of agents were doing in realtime and to filter and search their output.

A few interesting learnings from building and using this:

- Claude code hooks are blocking - performance degrades rapidly if you have a lot of plugins that use hooks

- Hooks provide a lot more useful info than OTEL data

- Claude's jsonl files provide the full picture

- Lifecycle management of MCP processes started by plugins is a bit kludgy at best

The biggest takeaway is how much of a difference it made in claude performance when I switched to background (fire and forget) hooks and removed all other plugins. It's easy to forget how many claude plugins I've installed and how they effect performance.

The Agents Observe plugin uses docker to start the API and dashboard service. This is a pattern I'd love to see used more often for security (think Axios hack) reasons. The tricky bit was handling process management across multiple claude instances - the solution was to have the server track active connections then auto shut itself down when not in use. Then the plugin spins it back up when a new session is started.

This tool has been incredibly useful for my own daily workflow. Enjoy!

Comments

theagentwall•1h ago
great idea. I am curious what the future of coding with multiple terminals and agents will look like and this looks like a great start!
simple10•1h ago
Thanks! This was step one in my daily driver stack - better observability. I also bundled up a bunch of other observability services in https://github.com/simple10/agent-super-spy so I can see the raw prompts and headers.

The next big layer for my personal stack is full orchestration. Something like Paperclip but much more specialized for my use cases.

ivaivanova•36m ago
Good to know background hooks make that much of a difference. How are you handling the case where multiple agent teams are writing to the same jsonl files simultaneously?
simple10•27m ago
I'm not actually reading the jsonl files. Agents Observe just uses hooks and sends all hook data the server (running as a docker container by default).

Basic flow:

1. Plugin registers hooks that call a dump pipe script that sends hook events data to api server

2. Server parses events and stores them in sqlite by session and agent id - mostly just stores data, minimal processing

3. Dashboard UI uses websockets to get real-time events from the server

4. UI does most of the heavy lifting by parsing events, grouping by agent / sub-agent, extracting out tool calls to dynamically create filters, etc.

It took a lot of iterations to keep things simple and performant.

You can easily modify the app/client UI code to fully customize the dashboard. The API app/server is intentionally unopinionated about how events will be rendered. This was by design to add support for other agent events soon.

ivaivanova•21m ago
The hooks approach seems much cleaner for real-time. Did you run into any issues with the blocking hooks degrading performance before you switched to background?
simple10•7m ago
Sort of. It wasn't really noticeable until I did an intentional audit of performance, then noticed the speed improvements.

Node has a 30-50ms cold start overhead. Then there's overhead in the hook script to read local config files, make http request to server, and check for callbacks. In practice, this was about 50-60ms per hook.

The background hook shim reduces latency to around 3-5ms (10x improvement). It was noticeable when using agent teams with 5+ sub-agents running in parallel.

But the real speed up was disabling all the other plugins I had been collecting. It piles up fast and is easy for me to forget what's installed globally.

I've also started periodically asking claude to analyze it's prompts to look for conflicts. It's shockingly common for plugins and skills to end up with contradictory instructions. Opus works around it just fine, but it's unnecessary overhead for every turn.

petcat•25m ago
Are you guys spending hundreds (or thousands) of dollars a day on Claude tokens? Holy crap. I can't get more than one or two agents to do anything useful for very long before I'm hitting my usage limits.
simple10•20m ago
I hit a lot of limits on Pro plan. Upgraded to Max $200/mo plan and haven't hit limits for awhile.

It's super important to check your plugins or use a proxy to inspect raw prompts. If you have a lot of skills and plugins installed, you'll burn through tokens 5-10x faster than normal.

Also have claude use sub-agents and agent teams. They're significantly lighter on token usage when they're spawned with fresh context windows. You can see in Agents Observe dashboard exactly what prompt and response claude is using for spawning sub-agents.

edwhitesell•8m ago
I'd bet there are many. I know a few teams with spends in the thousands of dollars per day. It sounds crazy, but not too unrealistic.
saadn92•11m ago
The hooks performance finding matches what I've seen. I run multiple Claude Code agents in parallel on a remote VM and the first thing I learned was that anything blocking in the agent's critical path kills throughput. Even a few hundred milliseconds per hook call compounds fast when you have agents making dozens of tool calls per minute.

The docker-based service pattern is smart too. I went a different direction for my own setup -- tmux sessions with worktree isolation per agent, which keeps things lightweight but means I have zero observability into what each agent is actually doing beyond tailing logs manually. This solves that gap in a way that doesn't add overhead to the agent itself, which is the right tradeoff.

Curious about one thing -- how does the dashboard handle the case where a sub-agent spawns its own sub-agents? Does it track the full tree or just one level deep?

Unsubscribe from the Church of Graphs

https://www.adorableandharmless.com/p/unsubscribe-from-the-church-of-graphs
3•devonnull•51s ago•0 comments

Ukrainian Drone Holds Position for 6 Weeks

https://defenceleaders.com/news/ukrainian-combat-robot-holds-frontline-position-for-six-weeks-in-...
1•AftHurrahWinch•1m ago•1 comments

Vibometer – The perfect vibe coder companion

https://github.com/GTP95/Vibometer
1•GTP•1m ago•0 comments

The AI Marketing BS Index

https://bastian.rieck.me/blog/2026/bs/
1•speckx•3m ago•0 comments

Recruiting (From Cavemen to LinkedIn Spam)

https://www.classet.ai/blog/history-of-recruiting
1•coopernewby•4m ago•0 comments

California's "Trusted AI" Order Is a Blueprint for a Censorship Weapon

https://twitter.com/amuse/status/2039367467022106689
1•11thEarlOfMar•5m ago•0 comments

Any good 2026 April Fools Pranks

1•ninju•5m ago•2 comments

Show HN: MinimumReleaseAge-style protection for Agent Skills

https://github.com/skill-mill/agent-skill-porter
1•hatappo•6m ago•1 comments

Designing for the Aliens

https://rajavijayaraman.com/writing/designing-for-alien/
1•rajavijayaraman•6m ago•0 comments

I built a Middle East conflict tracker that incorporates ground truth news

https://middleeastsignal.com/
1•amgadellaboudy•7m ago•1 comments

Show HN: Docs over SSH for Agents

https://github.com/supabase-community/supabase-ssh
1•gregnr•7m ago•0 comments

Federated and Independent [Plugin] Repositories in WordPress (Linux Foundation)

https://github.com/fairpm/fair-plugin
1•password4321•8m ago•0 comments

SpaceX Files to Go Public

https://www.nytimes.com/2026/04/01/technology/spacex-ipo-elon-musk.html
1•nutjob2•9m ago•0 comments

Run KataGo on iOS with Metal GPU Acceleration

https://github.com/eliotfowler/katago-metal-ios-demo
2•m-hodges•10m ago•0 comments

The Subprime Technical Debt Crisis

https://blog.happyfellow.dev/the-subprime-technical-debt-crisis/
1•birdculture•14m ago•0 comments

Rapid snow melt-off in American west stuns scientists

https://www.theguardian.com/us-news/2026/apr/01/snowmelt-american-west
2•dboreham•14m ago•1 comments

"attention is all you need" in screen recording

https://old.reddit.com/r/macapps/comments/1s938er/not_only_autozoom_also_spotlight_and_lightbox_for/
1•jiabins0303•15m ago•1 comments

Fungi could transform leftovers into lifelines

https://phys.org/news/2026-02-fungi-leftovers-lifelines.html
2•PaulHoule•17m ago•0 comments

Mercury 2, a diffusion LLM, outperforms StepFun 3.5 Flash on OpenClaw tasks

https://pinchbench.com/?view=graphs&graph=radar&models=inception%2Fmercury-2%2Cstepfun%2Fstep-3.5...
2•arpittarang•19m ago•1 comments

Banning All Anthropic Employees

https://joeyh.name/blog/entry/banning_all_Anthropic_employees/
4•speckx•22m ago•1 comments

Apple turns 50: tech company started with hobbyist computers

https://www.tomshardware.com/laptops/macbooks/apple-turns-50-one-of-the-worlds-biggest-tech-compa...
2•heresie-dabord•23m ago•0 comments

AI companies charge you 60% more based on your language, BPE tokens

https://tokenstree.com/newsletter-article-5.html
20•vfalbor•24m ago•10 comments

Show HN: Flight-Viz – 10K flights on a 3D globe in 3.5MB of Rust+WASM

https://flight-viz.com
2•coolwulf•24m ago•1 comments

Manage Your Odoo Operation from Your Phone in 2026

https://www.odooclaw.ai/blog/manage-your-entire-odoo-operation-from-your-phone-in-2026
1•oktra_dev•26m ago•0 comments

Apple Issues Rare iOS 18 Security Update to Protect Against DarkSword Exploit

https://www.macrumors.com/2026/04/01/apple-ios-18-darksword-security-update/
2•layer8•28m ago•0 comments

ASAPQuery: Drop-in accelerator for SQL/PromQL queries

https://github.com/ProjectASAP/ASAPQuery
1•milind1997•30m ago•1 comments

Proof-Carrying Border Gateway Protocol

https://rot256.dev/post/bgp-pcd/
3•alpn•31m ago•0 comments

Show HN: YouTube transcript viewer with search, language switching, and export

https://aiyoutubetranscript.com
1•ethanparker1•31m ago•0 comments

The Autocrat's Dilemma

https://www.theatlantic.com/international/2026/04/donald-trump-xi-jinping-china-authoritarianism/...
4•breve•32m ago•1 comments

The 'Claude-gap' relationship: While one partner sleeps, another vibe codes

https://www.businessinsider.com/claude-gap-relationship-vibe-code-couples-2026-3
1•jbredeche•32m ago•0 comments