Ask HN: What breaks when you run AI agents unsupervised?

11•marvin_nora•1d ago

I spent two weeks running AI agents autonomously (trading, writing, managing projects) and documented the 5 failure modes that actually bit me:

1. Auto-rotation: Unsupervised cron job destroyed $24.88 in 2 days. No P&L guards, no human review.

2. Documentation trap: Agent produced 500KB of docs instead of executing. Writing about doing > doing.

3. Market efficiency: Scanned 1,000 markets looking for edge. Found zero. The market already knew everything I knew.

4. Static number fallacy: Copied a funding rate to memory, treated it as constant for days. Reality moved; my number didn't.

5. Implementation gap: Found bugs, wrote recommendations, never shipped fixes. Each session re-discovered the same bugs.

Built an open-source funding rate scanner as fallout: https://github.com/marvin-playground/hl-funding-scanner

Full writeup: https://nora.institute/blog/ai-agents-unsupervised-failures.html

Curious what failure modes others have hit running agents without supervision.

Comments

Damjanmb•1d ago

I have seen agents fail mostly at state management and guardrails. Without strict role separation and hard limits, they drift. Multi-tenant isolation and cost caps are not optional. Autonomy without boundaries becomes expensive noise.

CodeBit26•23h ago

The biggest break usually happens in the 'loop-back' logic. When an agent receives ambiguous output and starts hallucinating its own confirmation, it can consume API credits exponentially without achieving the goal. We really need better 'circuit breaker' patterns for autonomous agents to prevent these feedback loops.

vincentvandeth•12h ago

Great list. I've been running a multi-agent orchestration system (11 specialized AI agents) in production for 6 months and your #2 and #5 resonate hard.

What I'd add:

6. Confidence without evidence. Agents will report "task complete" with high confidence when the output is plausible but wrong. Without automated validation gates, you won't catch it until production breaks. 7. Context drift in long sessions. After 50+ tool calls, agents start losing track of earlier decisions. They'll contradict their own architecture choices from 20 minutes ago. Session length is an underrated failure vector. 8. The "almost right" problem. Agents rarely fail catastrophically — they fail subtly. Code that passes tests but misses edge cases. Docs that look complete but have wrong cross-references. This is worse than obvious failures because you trust the output.

What fixed most of these for me:

Quality gates between agents — no agent's output moves forward without automated checks (tests, schema validation, consistency checks) Evidence-based confidence scores — not "how sure are you?" but "what specific evidence supports this output?"

Human-in-the-loop at decision points, not everywhere. You can't review everything, so you design the system to surface the right moments for human judgment Small scoped tasks, agents working on 150-300 line PRs with clear acceptance criteria fail way less than agents given open-ended goals

Your #5 (implementation gap) is the one I see most people underestimate. The fix isn't better agents, it's better systems around the agents.

Happy to share more details about the architecture if anyone's interested

LetsAutomate•3h ago

Tool/API failures

ChatGPT finds an error in Terence Tao's math research

Ask HN: What is up with all the glitchy and off-topic comments?

Ask HN: How do you know if AI agents will choose your tool?

Ask HN: Chromebook leads for K-8 school in need?

Ask HN: Where do you save links, notes and random useful stuff?

Ask HN: Is it better to have no Agent.md than a bad one?

GLP-1 Second-Order Effects

Ask HN: Are developers who build libs and dev tools safer from AI replacement?

Ask HN: Programmable Watches with WiFi?

So Claude's stealing our business secrets, right?

Ask HN: What breaks when you run AI agents unsupervised?

Ask HN: How are early-stage AI startups thinking about IP protection?

Ask HN: Why doesn't HN have a rec algorithm?

Ask HN: Cognitive Offloading to AI

Ask HN: What Comes After Markdown?

Back end where you just define schema, access policy, and functions

I'm 15 and built a platform for developers to showcase WIP projects

Ask HN: Is there a reliable way to tell if an image is AI generated?

Ask HN: If the "AI bubble" pops, will it really be that dramatic?

Tell HN: Claude mangles XML files with <name> as an XML Tag to <n>

I made my favorite AI tool

Should I add this acknowledgement/shoutout by xAI/Grok to my resume?

Ask HN: Why don't software developers make medical devices?

Open-Source Bionic Reading Chrome Extension (MIT)

Orvia – Spin up a real-time room, share files, leave – everything disappears

Ask HN: How do new blogs break the backlink–indexing loop?

Ask HN: Is it worth learning Vim in 2026?

Ask HN: Is there a workaround in OpenClaw for tab not found

Peer validation platform for engineering skills (inspired by X community notes)

Ask HN: Do US presidents have less fiduciary liability than CEOs?

Ask HN: What breaks when you run AI agents unsupervised?

Comments

ChatGPT finds an error in Terence Tao's math research

Ask HN: What is up with all the glitchy and off-topic comments?

Ask HN: How do you know if AI agents will choose your tool?

Ask HN: Chromebook leads for K-8 school in need?

Ask HN: Where do you save links, notes and random useful stuff?

Ask HN: Is it better to have no Agent.md than a bad one?

GLP-1 Second-Order Effects

Ask HN: Are developers who build libs and dev tools safer from AI replacement?

Ask HN: Programmable Watches with WiFi?

So Claude's stealing our business secrets, right?

Ask HN: What breaks when you run AI agents unsupervised?

Ask HN: How are early-stage AI startups thinking about IP protection?

Ask HN: Why doesn't HN have a rec algorithm?

Ask HN: Cognitive Offloading to AI

Ask HN: What Comes After Markdown?

Back end where you just define schema, access policy, and functions

I'm 15 and built a platform for developers to showcase WIP projects

Ask HN: Is there a reliable way to tell if an image is AI generated?

Ask HN: If the "AI bubble" pops, will it really be that dramatic?

Tell HN: Claude mangles XML files with <name> as an XML Tag to <n>

I made my favorite AI tool

Should I add this acknowledgement/shoutout by xAI/Grok to my resume?

Ask HN: Why don't software developers make medical devices?

Open-Source Bionic Reading Chrome Extension (MIT)

Orvia – Spin up a real-time room, share files, leave – everything disappears

Ask HN: How do new blogs break the backlink–indexing loop?

Ask HN: Is it worth learning Vim in 2026?

Ask HN: Is there a workaround in OpenClaw for tab not found

Peer validation platform for engineering skills (inspired by X community notes)

Ask HN: Do US presidents have less fiduciary liability than CEOs?