I let my AI agents run unsupervised and they burned $200 in 2 hours

https://blog.justcopy.ai/p/i-let-my-ai-agents-run-unsupervised

21•anupsingh123•3mo ago

Comments

anupsingh123•3mo ago

Classic "I'll be right back" moment that cost me real money.

Building justcopy.ai - lets you clone, customize and ship any website. Built 7 AI agents to handle the dev workflow automatically.

Kicked them off to test something. Went to grab coffee.

Came back to a $100 spike on my OpenRouter bill. First thought: "holy shit we have users!"

We did not have users.

Added logging. The agent was still running. Making calls. Spending money. Just... going. Completely autonomous in the worst possible way. Final damage: $200.

The fix was embarrassingly simple: - Check for interrupts before every API call - Add hard budget limits per session - Set timeouts on literally everything - Log everything so you're not flying blind

Basically: autonomous ≠ unsupervised. These things will happily burn your money until you tell them to stop.

Has this happened to anyone else? What safety mechanisms are you using?

fragmede•3mo ago

Privacy.com credit card with a limit set, and making sure that billing is not set to auto on the LLM platform.

anupsingh123•3mo ago

How would that help with supervising agent runs for each user on justcopy.ai?

fragmede•3mo ago

Anthropic won't run your API calls if you're out of API credits (and on that plan) so if there's only $10 in the account, you run $10 worth of API calls, and then the calls fail instead of costing you money.

W3schoolz•3mo ago

What a great learning opportunity! Supervision is key and budget limits are highly valuable in preventing surprises.

That said, I think a budget limit of $5-10k per agent makes sense IMO. You're underpaying your agents and won't get principal engineer quality at those rates.

magicalhippo•3mo ago

I thought the hotel AI's playing poker together in Altered Carbon was a bit cheesy until these newfangled LLM-driven agents came along, and it all seemed a lot more realistic.

Agents doing nothing, just doing things for the sake of doing things.

Seems we're there.

vorpalhex•3mo ago

"Good job claude, go ahead and fire up some poker with your friends for a few hours. You've earned some downtime."

I am now going to make a multi-agent poker MCP as a joke. Thank you.

SpaceNoodled•3mo ago

My chief safety mechanism is not using money-burning slop generators.

anupsingh123•3mo ago

That's one approach. For me, the agent setup cut what used to be a full day of manual work down to minutes - even with the $200 learning tax, that's still a net win. But I get the skepticism.

leptons•3mo ago

Oh, they burned a lot more than $200, you just paid only $200. These things are costing way more than what people pay for them, the price heavily subsidized.

simonw•3mo ago

I think the opposite is much more likely to be true: that vendors who charge money for inference are charging more than it costs them to service a prompt.

I've heard from sources that I trust that both AWS and Google Gemini charge more than it costs them in energy to run inference.

You can get a good estimate for the truth here by considering open weight models. It's possible to determine exactly how much energy it costs to serve DeepSeek V3.2 Exp, since that model is open weight. So run that calculation, then take a look at how much providers are charging to serve it and see if they are likely operating at a loss.

Here are some prices for that particular model: https://openrouter.ai/deepseek/deepseek-v3.2-exp/providers

Tade0•3mo ago

If that's the case, then why are AI companies bleeding money?

Or: what are they bleeding money on?

anupsingh123•3mo ago

btw this was DeepSeek-V3.2. If I'd been using Claude Sonnet 4.5, we'd be looking at a $2000 bill instead.

Tade0•3mo ago

Okay, yikes. Good thing that you even can set up those controls, unlike with that other company in the compute infrastructure business.

barrkel•3mo ago

Research runs mostly.

https://epoch.ai/data-insights/openai-compute-spend

simonw•3mo ago

They lose money on research and training and offering model trials for free (a marketing expenses).

That doesn't mean that when they do charge for the models - especially via their APIs - that they are serving them at a unit cost loss.

surgical_fire•3mo ago

Depends on the vendor and how they charge. OpenAI loses money on subscriptions [1]. Maybe the people who pay 200 bucks on a subscription are exactly the kind of people that will try to use the maximum out of it, and if you go down to the 20 bucks tier you will find more of the type of user that pays but doesn't use it all that much?

I would presume that companies selling compute for AI inference either make some money or at least break even when they serve a request. But I wouldn't b surprised if they are subsidizing this cost for the time being.

[1]: https://finance.yahoo.com/news/sam-altman-says-losing-money-...

simonw•3mo ago

That "losing money on subscriptions" story is a one-off Sam Altman tweet from January 2025, when they were promoting their brand new $200 account and the first version of Sora. I wouldn't treat that as a universal truth.

https://twitter.com/sama/status/1876104315296968813

"insane thing: we are currently losing money on openai pro subscriptions!

people use it much more than we expected"

surgical_fire•3mo ago

Sam Altman is a bullshitter. A liar cares about the truth and attempts to hide it. A bullshitter doesn't care if something is true of false, and is just using rhetoric to convince you of something.

I don't doubt that it is true that they lose money on a 200 subscription because the people that pay 200 are probably the same people that will max out usage over time, no matter how wasteful. Sam Altman was framing it in a way to say "it's so useful people are using it more than we expected!", because he is interested in having everyone believe that LLMs are the future. It's all bullshit.

If I had to guess, they probably at least break even on API calls, and might make some money on lower tier subscriptions (i.e.: people that pay for it but use it sparingly on a as-need basis).

But that is boring, and hints at limited usability. Investors won't want to burn hundreds of billions in cash for something that may be sort of useful. They want destructive amounts of money in return.

Tade0•3mo ago

Ok, fine, but I think it's disindigenous to only mention energy expenditure. There's also infrastructure, necessary re-training and R&D - of which we don't know how much must be spent just to stay in the market.

simonw•3mo ago

Competitive, venture backed companies losing money when you take R&D into account in a high growth market is how the tech industry has worked for decades.

Shopify, Uber and Airbnb all hit profitability after 14 years. Amazon took 9.

Tade0•3mo ago

The mentioned didn't require the sort of R&D AI does.

And this isn't something that will go away anytime soon. OpenAI for instance is projecting that in 2030 R&D will still account for 45% of their costs. They think they'll be profitable by that time, or so they're telling investors.

leptons•3mo ago

And none of those companies lost anywhere near as much money as "AI" is currently, and will continue to do. Just because they become profitable 5 or 10 or 15 years from now does not mean that they will be able to pay off the hundreds of billions to trillions spent getting them there anytime soon. And for what? AI slop ruining every fucking thing while heating the planet ever faster? Sounds like a great future we have ahead with "AI".

Ferret7446•3mo ago

On building the next new feature/integration/whatever? I feel like this should be a rhetorical question, but the fact that it was asked I also feel it is not so...

beAbU•3mo ago

You cant conveniently ignore the cost of model development and training.

This is like saying solar power is free if you ignore the equipment and installation costs.

Even worse still, model creators are in an arms race. They can't release a model and call it a day, waiting for it to start paying for itself. They need to immediately jump on to the next version of the model or risk falling behind.

automatic6131•3mo ago

The kind of person who wants to build a website copier is exactly who I had in mind for the target of vibecoding.

Bad idea, bad execution, I like it when a plan comes together.

anupsingh123•3mo ago

I think there's some confusion about what justcopy does - it's for cloning YOUR OWN projects, not scraping other people's websites. Built it out of frustration when I tried to fork one of my projects for a different idea and it took a full day even with Claude Code and Cursor. Lots of manual config updates, dependency changes, renaming stuff, etc. The $200 mistake was about agent orchestration, not the ethics of the product. But appreciate the feedback - clearly need to communicate the use case better.

automatic6131•3mo ago

I'm not going to pay you to slightly rip off my own ideas. Who is going to pay you for this, and what are they doing with it?

chucksta•3mo ago

>For those who don’t know, we’re building a tool that lets you copy any website, customize it, and deploy it - all automated.

_any_ website, can't imagine why there is _any_ confusion.

dwaltrip•3mo ago

God, I hate marketing lies.

I don’t care if you make less money, don’t fucking lie.

brazukadev•3mo ago

Are you expecting people to believe that?

This reminds of that law that people can only legally play their own games using console emulators.

dominicrose•3mo ago

Even without AI, companies have been burning cash uncontrollably on cloud services. I guess it's worth it when time saved, scalability etc, is much much more valuable than money.

pjdkoch•3mo ago

If you buy senior engineering hours and give them vague requirements, this is close enough to what you'll get.

cafebabbe•3mo ago

Ah so this is where the current GDP growth comes from.

jb4020•3mo ago

This is phishing/scam heaven. I already warned some european friends in healthcare about this and hope someone considers legal steps against such unethical and dangerous practices.

Al Lowe on model trains, funny deaths and working with Disney

Hoot: Scheme on WebAssembly

First Proof

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

Stories from 25 Years of Software Development

Reinforcement Learning from Human Feedback

The Waymo World Model

Start all of your commands with a comma (2009)

France's homegrown open source online office suite

Vocal Guide – belt sing without killing yourself

The AI boom is causing shortages everywhere else

Software factories and the agentic moment

Coding agents have replaced every framework I used

A Fresh Look at IBM 3270 Information Display System

What Is Stoicism?

72M Points of Interest

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Where did all the starships go?

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Learning from context is harder than we thought

Monty: A minimal, secure Python interpreter written in Rust for use by AI

British drivers over 70 to face eye tests every three years

Making geo joins faster with H3 indexes

Hackers (1995) Animated Experience

Sheldon Brown's Bicycle Technical Info

Ga68, a GNU Algol 68 Compiler

Show HN: I spent 4 years building a UI design tool with only the features I use

An Update on Heroku

Show HN: If you lose your memory, how to regain access to your computer?

What Is Ruliology?

Al Lowe on model trains, funny deaths and working with Disney

Hoot: Scheme on WebAssembly

First Proof

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

Stories from 25 Years of Software Development

Reinforcement Learning from Human Feedback

The Waymo World Model

Start all of your commands with a comma (2009)

France's homegrown open source online office suite

Vocal Guide – belt sing without killing yourself

The AI boom is causing shortages everywhere else

Software factories and the agentic moment

Coding agents have replaced every framework I used

A Fresh Look at IBM 3270 Information Display System

What Is Stoicism?

72M Points of Interest

Unseen Footage of Atari Battlezone Arcade Cabinet Production

Where did all the starships go?

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Learning from context is harder than we thought

Monty: A minimal, secure Python interpreter written in Rust for use by AI

British drivers over 70 to face eye tests every three years

Making geo joins faster with H3 indexes

Hackers (1995) Animated Experience

Sheldon Brown's Bicycle Technical Info

Ga68, a GNU Algol 68 Compiler

Show HN: I spent 4 years building a UI design tool with only the features I use

An Update on Heroku

Show HN: If you lose your memory, how to regain access to your computer?

What Is Ruliology?

I let my AI agents run unsupervised and they burned $200 in 2 hours

Comments