Show HN: AgentFuse – A local circuit breaker to prevent $500 OpenAI bills

https://github.com/AbdulBasitA/agent-fuse

3•abdulbasitali•1mo ago

Hey HN,

I’ve been building agents recently, and I hit a problem: I fell asleep while a script was running, and my agent got stuck in a loop. I woke up to a drained OpenAI credit balance.

I looked for a tool to prevent this, but most solutions were heavy enterprise proxies or cloud dashboards. I just wanted a simple "fuse" that runs on my laptop and stops the bleeding before it hits the API.

So I built AgentFuse.

It is a lightweight, local library that acts as a circuit breaker for LLM calls.

Drop-in Shim: It wraps the openai client (and supports LangChain) so you don't have to rewrite your agent logic.

Local State: It uses SQLite in WAL mode to track spend across multiple concurrent agents/terminal tabs.

Hard Limits: It enforces a daily budget (e.g., stops execution at $5.00).

It’s open source and available on PyPI (pip install agent-fuse).

I’d love feedback on the implementation, specifically the SQLite concurrency logic! I tried to make it as robust as possible without needing a separate server process.

Comments

dmarwicke•1mo ago

had this happen with a retry loop. hit $80 on anthropic before i caught it. how does this handle retries? seems like an agent could just keep retrying and blow past the limit

abdulbasitali•1mo ago

That is exactly why we enforce a hard budget cap (default is $1.00).

The system runs a pre-flight check before every LLM call. If you're about to go over budget, it kills the process immediately (SentinelBudgetExceeded).

We don't have a specific "max retries" counter wired up yet, but I'll likely add that soon based on your feedback. For now, the budget cap would have caught it at $1.

abdulbasitali•1mo ago

I just added loop detection based on your comment. It fingerprints tool actions (tool name + args) and kills the agent if it tries the exact same thing 5 times in a row.

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

Show HN: If you lose your memory, how to regain access to your computer?

Show HN: I spent 4 years building a UI design tool with only the features I use

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

Show HN: Stacky – certain block game clone

Show HN: A toy compiler I built in high school (runs in browser)

Show HN: Smooth CLI – Token-efficient browser for AI agents

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

Show HN: Nginx-defender – realtime abuse blocking for Nginx

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

Show HN: Slack CLI for Agents

Show HN: ARM64 Android Dev Kit

Show HN: MCP App to play backgammon with your LLM

Show HN: Gigacode – Use OpenCode's UI with Claude Code/Codex/Amp

Show HN: I'm 75, building an OSS Virtual Protest Protocol for digital activism

Show HN: I built Divvy to split restaurant bills from a photo

Show HN: Which chef knife steels are good? Data from 540 Reddit tread

Show HN: Micropolis/SimCity Clone in Emacs Lisp

Show HN: I Hacked My Family's Meal Planning with an App

Show HN: Slop News – HN front page now, but it's all slop

Show HN: I built a free UCP checker – see if AI agents can find your store

Show HN: Daily-updated database of malicious browser extensions

Show HN: Falcon's Eye (isometric NetHack) running in the browser via WebAssembly

Show HN: XAPIs.dev – Twitter API Alternative at 90% Lower Cost

Show HN: Horizons – OSS agent execution engine

Show HN: Compile-Time Vibe Coding

Show HN: Local task classifier and dispatcher on RTX 3080