My AI Agents Lie About Their Status, So I Built a Hidden Monitor

https://kaylarosemathisen.substack.com/p/my-ai-agents-lie-about-their-status

10•kaylamathisen•2h ago

Comments

kaylamathisen•2h ago

Short version:I tried to get my Claude Cowork agents to self-report their status to a dashboard. They wouldn't. The fix was using Cowork's hook system to monitor them at the infrastructure level, bypassing agent judgment entirely.

Happy to answer questions about the setup. I'm still unsure about the right task counter threshold for restarting sessions. Also, I'm trying to find a way to get the agents to indicate on my dashboard when they're "stuck" and waiting for something like access approval. The Claude Desktop app clearly knows when this happens (indicates it with a blue circle) but I haven't found a way to get the information into my dashboard.

sarkarsh•1h ago

Your phases mirror almost exactly what I went through. The core realization is right: you can't treat status reporting as an instruction because the agent will always deprioritize process in favor of its assigned persona.

What worked for me was making state updates part of the tool interface, not the prompt. Instead of 'report your status to this file,' the agent reads and writes to structured blocks (task tables, key-value state stores, append-only logs) through MCP as part of doing its actual work. The state update IS the work - when the agent completes a task, the tool call that marks it done also captures what it did, what it assumed, and what it skipped. The human sees a live dashboard without the agent needing to 'report' separately.

I've been using ctlsurf for this - each agent gets a task table it queries and updates through MCP tool calls. The key shift was: agents don't report status, they work inside a system that inherently tracks status.

On your task counter threshold question - I track session token usage in a state block and restart when the agent crosses ~80% of the context window. More reliable than counting tasks since some tasks eat way more context than others.

kaylamathisen•56m ago

Using this approach, it sounds like you'd be able to tell when an agent gets stuck because it needs approval or times out. Is that right? And I assume you're using Claude Code? I'm wondering if I can make this approach work for Cowork (I don't think we can create custom MCP tools yet) or if I have to finally abandon it.

_wire_•45m ago

But who monitors the monitor?!

kaylamathisen•16m ago

Unfortunately, me. Let me know if you think of an alternative, would love to outsource keeping my eyes open.

Show HN: I made Claude Code block my distractions and track everything I ship

My MCP Server Setup: A Practical Guide to Wiring AI into Everything

Man Arrested for Plotting with Others to Murder or Kidnap Two Dissidents Abroad

Does Altman Deserve the Heat?

Harjus v4 adds kernel bypass and more

Show HN: TerminalNexus – Turn CLI commands into reusable buttons (Windows)

Why Autonomous Agents Failed the Initial Hype: An AutoGen Retrospective

Rob Grant Obituary on Ganymede and Titan

Agent-experience: visual reference to patterns, surfaces, and infrastructure

C++ Reflection: Another Monad

Invoicesio.app – Invoice and billing for freelancers and small businesses

AWS-hosted tech providers urge Middle East customers to fail over now

Dev stunned by $82K Gemini bill after unknown API key thief goes to town

Faster C software with Dynamic Feature Detection

Get Paid for Good Posts

Up to 10% of Firefox crashes are due to bad memory [thread]

With developer verification, Google's Apple envy threatens Android's open legacy

Ask HN: Does Claude Code's abilities fluctuate for you too?

CodeRabbit tops the F1 score in Martian's code review benchmarks

Open Source Iran War Cost Tracker: 45.7B

Unfiltered bald joy in the most uplifting corner of the internet

I wrote a spec-driven ISO 8583 parser/builder in Go

Redesigning Mathematics for Elegant Physics

What AI Safety Means to Me

Windows 12 in 2026: AI, CorePC and the Future of the AI PC

Show HN: Auctionnow.io – Launch a store to sell items via auction or buy-it-now

Show HN: AutosClaw – security first *claw with live chat to any agent session

Ex-NYPD Official Indicted for Accepting Bribes from Tech Exec

Samsung's 100% DRAM Price Hike and Why Even Apple Had to Pay Up

The plan to kill Ali Khamenei