frontpage.

Burned $250 in tokens on Day 1 with OpenClaw

2•aposded•1h ago

When I first set up OpenClaw, I ran into a big problem immediately.

I spent $250 on my first day doing what felt like harmless testing.

Nothing production. No customers. Just me trying things like:

“Summarize this Slack thread”

“Give me a morning digest”

“Explain this error log”

“Pull action items from the last N messages”

A couple Telegram alerts

At first I blamed OpenClaw. The real issue was simpler: I had Claude set as the default for basically everything, and I accidentally created a workflow where every run got more expensive than the last.

Here’s what actually happened.

“Simple tasks” weren’t simple because the context kept growing I started with “summarize the last 30–50 messages.” Then I kept adding “just one more thing”:

include prior decisions

keep continuity across runs

include relevant earlier context

make it more detailed

That makes results feel smarter, but it turns every request into a bigger prompt. The tricky part is it still feels like the same task, so you don’t notice the cost drift until the number is already big.

Tool output bloat snowballed I let tool outputs flow straight into the next step:

long logs

giant diffs

full API responses

“for debugging” screenshots

Even if one run is tolerable, the next run inherits the baggage. This is how testing quietly becomes a token furnace: output becomes input becomes output again.

Scheduled jobs created an “idle → warm-up tax” loop I had cron-ish jobs that ran, went idle, then ran again.

If your setup effectively re-establishes a big prompt footprint on each run, you keep paying the setup cost repeatedly. It’s not one catastrophic request. It’s lots of medium ones with repeated overhead.

Duplicates from retries/triggers A couple times I saw behavior consistent with “the same expensive work executed twice”:

transient slowdowns causing retries

duplicated triggers from chat integrations

One duplicated summarization run isn’t a rounding error when the prompt is already bloated.

So why did it hit $250 so fast? Because Claude was my default hammer for every nail, and I unintentionally designed the system to feed itself bigger and bigger inputs.

What fixed it (the boring, effective stuff)

- Hard caps on what gets summarized (smaller windows, tighter selection)

- Aggressive trimming of tool output (only keep what the next step truly needs)

- Removed screenshots unless strictly required

- Forced “fresh session” boundaries for scheduled jobs so context can’t grow forever

- Output length ceilings so digests can’t become essays

- De-duped triggers and made retries safer to avoid re-running the same job twice

- And the biggest one: stop using the most expensive model by default for routine steps

The part that pushed me into building something After that first-day bill, the pattern was obvious: relying on discipline (“I’ll remember to switch models later”) doesn’t scale.

Claude was the immediate cost driver, so I took the routing model I’d built for Agentlify and adapted it into a custom routing layer specifically for OpenClaw: cheap/fast models for routine steps, only escalate to Claude when the task actually needs it. That became https://clawpane.co

Not linking anything here. The point isn’t “buy my thing.” The point is that routing stops being an optimization and becomes a seatbelt once you’ve had one day like this.

Takeaway If you’re trialing agent workflows and your bill is spiking, it’s usually not one big request. It’s:

- context creep

- tool payloads piling up

- scheduled runs repeatedly paying warm-up overhead

- occasional duplicates

and an expensive default model doing work that doesn’t require it.

If you want, reply with what tasks you’re running and what your defaults look like. I’ll tell you where the spend usually hides.

Teams Or Not? Check what the domain/email is using

Show HN: AccIQ; A local-first financial IDE for sovereign founders

Discussion: Why Testing Is Important

"AI raises the quality of tuning beyond what most of us can achieve manually"

Show HN: PolyTell-AI Chrome extension that shows Polymarket odds as you browse

Show HN: NetWatch – A Wireshark-style network analyzer TUI built in Rust

Show HN: Treekei – File Tree with Line Counts in CLI

Free AI Headshot Generator – Professional Photos from Any Selfie

LM Link: Use local models on remote devices, powered by Tailscale

$2.1B in Epstein Financial Records. Here's Every Name the Money Touched

Anthropic/Pentagon: allow AI to be used for all military purposes by this Friday

Show HN: Rewrite Text – On-Device AI Writing Tool for iOS

Investment Supply Chain Analysis

Show HN: Skillscape – Engineering skills matrix without the spreadsheet

SimpleSteps – TypeScript-to-ASL Compiler

Demonstration of Network Tap and Packet Filter Using a Security Camera

I thought freelancers hated invoices. They hated the tools

ThePrimeagen goes back to traditional coding

When "technically true" becomes "misleading"

Australia's WiseTech to cut 2k jobs as AI renders manual coding obsolete

CleverMock – An AI voice interviewer that interrupts you like a real human

Show HN: Programmatic (and self-updating) SaaS demo videos

Show HN: Bing Webmaster CLI for Agents and LLMs

A White House Staffer Appears to Run Pro-Trump X Account

Show HN: Onera – Private LLM Inference Inside AMD SEV-SNP Enclaves

Next-Token Predictor Is an AI's Job, Not Its Species

Tests Are the New Moat

'Access to Insight' is shutting down

The next batch of fixed Epstein files links and notes is live

Programming has changed dramatically due to AI in the last 2 months (Karpathy)