frontpage.

I let Claude Code context accumulate and burned $2,000+ in tokens — without noticing any quality drop

I made a costly mistake with Claude Code: I let context accumulate across long sessions and relied on auto-compaction. What surprised me is that the output quality stayed good. I was happy with the results. There was no obvious “something is wrong” signal. The only thing that went wrong was the bill.

What happened - I kept iterating in the same long-lived context. - Auto-compaction kicked in when needed. - Sonnet was used by default most of the time. - From a UX perspective, everything felt fine.

Why this is dangerous - There’s no clear quality degradation to warn you. - Token usage grows invisibly in the background. - Auto-compaction itself consumes a lot of tokens. - You only realize something is wrong when you look at the invoice.

Root causes - Long-lived context feels convenient, so you don’t reset it. - Tooling doesn’t surface cost as a first-class signal. - The mental model (“LLMs can remember my project”) is seductive but expensive. - Defaulting to a large model makes the problem much worse.

What I changed - One task = one fresh context. - Externalized memory: project state lives in `context.md` / `decisions.md`, not in prompt history. - Default to a smaller model; large models only for design/architecture. - Diff-only outputs: no full file re-dumps. - Disabled auto-compaction; summaries now live in docs. - Added cost visibility: token counters and budget caps.

Takeaway This isn’t about output quality degrading. It’s about cost scaling quietly without feedback. LLMs make it too easy to accumulate invisible technical debt in tokens.

If tooling doesn’t make cost visible, people will keep doing this.

What It's Like to Be a Worm

Anticipating aging-related mental decline using saliva samples and AI

Step 3.5 Flash

Ideas for marketing a dev centric product (2019)

DHS AI Surveillance Arsenal Grows as Agency Defies Courts

Show HN: AI accountability partner helps you meet your goals

Show HN: Distillmed – NotebookLM for Expert Witnesses

Analysing the BlastPass Pegasus 0-click Exploit for iOS 16.6 [video]

Ingredient Score

Show HN: VOR – A verified runtime with 0% hallucination via observations

ICE halts "all movement" at Texas detention facility due to measles infections

Show HN: Super Bowl Party for AI Agents by AI Agents

Show HN: I built a pixel art maker

The LLM Revolution Is Over. The Physical AI Revolution Is Coming Fast [video]

Agent Development Kit (ADK-Go) v0.4.0

Show HN: Is AI "good" yet? – tracking HN sentiment on AI coding

Show HN: Devin-CLI – The missing link for Agent-to-Agent orchestration

Notepad++ hijacked by state-sponsored actors

Trump plans to close Kennedy Center for two years for reconstruction work

Iran presidency releases names of those killed in anti-government protests

Show HN: Bunqueue – Job queue for Bun using SQLite instead of Redis

Clang Hardening Cheat Sheet – Ten Years Later

Show HN: Open-source, offline Kanban board with "swim lanes"

Show HN:Coordinating 10-agent teams with OpenClaw and shared persistent memory

Show HN: Toktrack – 40x faster AI token tracker, rewritten from Node.js to Rust

Lightweight Compression in DuckDB (2022)

Nanobot: Ultra-Lightweight Personal AI Assistant

Lodash's Security Reset and Maintenance Reboot

How to Win Titular Metagames

The information concierge

Auto-compaction felt fine. The invoice didn't