Show HN: SatGate – An economic firewall for AI agent traffic

https://github.com/SatGate-io/satgate

1•satgate•1h ago

SatGate is an API gateway that enforces budgets on AI agent traffic. Open source (Go, Apache 2.0).

The problem: AI agents make API calls that cost money — LLM inference, tool calls, third-party services. Most setups have no hard spending limits. An agent loop or prompt injection can burn through hundreds of dollars before anyone notices. Rate limiting doesn't help because it doesn't understand cost.

SatGate sits in front of your agent's outbound calls and enforces economic policy:

• Hard budget caps — per-agent, per-tool, per-time-window. Not alerts, actual enforcement. The call gets rejected.

• Per-tool cost attribution — MCP-aware. Knows which tool in a chain caused what spend. Not just "1,000 requests" but "Agent X spent $47 on search_database and $12 on send_email."

• Macaroon capability tokens — cryptographic credentials with built-in caveats (budget, time window, allowed tools). Agents can sub-delegate scoped tokens without calling home. Not API keys.

• L402 Lightning micropayments — agents can pay for API access per-call using HTTP 402. Sub-cent pricing that doesn't work on card rails.

It's not a routing gateway. LiteLLM and Bifrost solve which provider handles a request. SatGate decides whether the request should happen at all given your budget constraints. They're complementary — SatGate sits in front of a routing gateway.

What it doesn't do: It doesn't optimize costs, negotiate rates, or pick cheaper providers. It's a policy enforcement layer, not an optimizer.

Single binary, 60-second quickstart, <50ms overhead.

GitHub: https://github.com/SatGate-io/satgate Blog: https://satgate.io/blog/why-routing-isnt-governance

Comments

satgate•1h ago

I've spent 27 years in enterprise cybersecurity — firewalls, IDS, access control, the usual stack. When I started running AI agents against production APIs last year, I had a familiar feeling: this looks exactly like the early internet before we figured out network security.

Agents make outbound calls with real dollar costs attached. The tooling to control that spend mostly comes down to "set an alert and hope someone's watching." I've seen agents in tight loops burn through $400 in minutes on tool calls nobody intended. One prompt injection away from draining a prepaid API balance.

The security stack has authentication, authorization, rate limiting — but nothing that understands cost as a first-class constraint. You can't express "this agent can spend $50/day across these tools" in a WAF rule.

So I built SatGate. It's a policy enforcement point for economic decisions. It reads cost metadata from MCP tool manifests, tracks cumulative spend per agent, and hard-blocks calls that would exceed budget.

We use macaroon tokens instead of API keys because they support attenuation — an agent can delegate a sub-token with tighter constraints without any server round-trip. A parent agent gives a child agent a token that says "you can spend $10 on search_database in the next hour." The child can't escalate.

The L402/Lightning piece came later — it turns out micropayments are a natural fit for agent-to-API commerce where you want per-call settlement without monthly invoices or API key management.

I looked at the existing landscape: Bifrost has soft budgets (alerts, no enforcement). Zuplo and Kong are solid API gateways but have no concept of economic controls. Nothing combined hard limits + per-tool costs + payments in one layer.

It's open source because I think this needs to be infrastructure, not a product. <50ms overhead, single Go binary, runs anywhere.

Happy to answer questions about the architecture, the macaroon auth model, or the problem space.

Attention Sinks and Compression Valleys in LLMs

AI Chat Evaluation of the Formal Language in He Xin's PEPC System 2

Hand tool rewrites ancient Egyptian history

A note about personal security

AI Chat Evaluation of the Formal Language in He Xin's PEPC System

A Note on File History in Emacs

Revisionist History – Aliens, Secrets and Conspiracies

Show HN: cbt (C++ Build Tool)

Open model StepFun-3.5 is #1 on MathArena, an uncheatable math benchmark

Show HN: Bitcoin, GEB, and Bach's fugues share the same structural move

Functional Programming in M4

AI makes it easier to build the wrong thing faster

Show HN: I built a macOS desktop toy that patrols while you work

Poison at Play: Unsafe lead levels found in half of New Orleans playgrounds

Unresponsive Buttons on My Fastest Hardware

AI-First Company Memos

How to Test ProxySQL Read/Write Split with Sysbench

The singularity won't be gentle – by Nate Silver

A New Computer Could Replace Electricity with Light

Show HN: Health.md - Apple Health → Markdown

PicoClaw: Ultra-Efficient AI Assistant in Go

AITools.coffee – GitHub metrics observatory tracking 27K+ open-source AI repos

AI Agents 101: From Concept to Code (No Frameworks Required)

Databases should contain their own Metadata – Use SQL Everywhere

Seeking Order in Chaos

Show HN: Funxy – A typed scripting language that embeds into Go apps

The jarring experience of developing today

Kiro: DeepSeek, MiniMax, and Qwen now available as open weight model options

Terence Tao: Why I Co-Founded SAIR

Maia 200: The AI accelerator built for inference