frontpage.

Show HN: Bypassing Stripe's 20-item limit for multi-model AI billing

1•markaroo•1h ago

We’ve been building out the usage-based billing for Tonic Fabricate, and I ran into a Stripe limitation that isn’t really advertised but can completely break your checkout flow if you’re doing multi-model AI billing.

Like many AI-native applications, Fabricate supports a variety of LLMs (Sonnet, Haiku, Opus, etc.) for various agents and tasks. Users pay for usage by the token. We’d love to use Stripe Billing Meters to charge users by the token — I trust Stripe’s math more than my own when it comes to floating point arithmetic with very large numbers.

But Stripe caps subscriptions at 20 items. Each meter requires its own price, which in turn consumes one of those 20 item slots.

This becomes a problem fast with AI. If you multiply 4-5 models by 5 token types (input, output, cached input, and ephemeral cache tiers), you’re already at 20+ meters. Adding even one more model exceeds the limit.

The workaround we settled on was to move the pricing logic entirely into Fabricate and report abstract "billing units" to Stripe instead of raw tokens.

The setup is pretty simple:

1. We created one single meter in Stripe with a fixed price of $0.00000001 per unit.

2. Internally, we assigned an integer multiplier to every model/token combination (e.g., Sonnet input might be 350 units per token, while Haiku cached input is 10).

3. At reporting time, we do the integer math: (tokens * multiplier) for each category, sum them up, and send that one aggregate number to Stripe.

This effectively decouples our model list from Stripe’s API constraints. We can add as many models or token tiers as we want without ever touching Stripe configuration again.

The obvious downside is that the Stripe dashboard just shows a massive number of "units" rather than a breakdown of what the customer actually used. We decided we didn't care. We have our own internal telemetry for usage analytics; Stripe’s only job is to multiply a number by a price and generate a valid invoice.

If you’re setting up usage-based billing with more than a few dimensions, I’d recommend abstracting your units from day one. It’s much easier than trying to migrate your billing architecture once you’ve already hit that 20-item ceiling. And of course if you're Stripe (hi, Stripe!), I really recommend making changes to your product to offer native support for this very popular use case.

The Claude-Native Law Firm

Minitest-Strict 1.0.0 for Ruby

Praia dos Cristais – Tiny Spanish beach covered in sea glass

Nüshu, the 19th-Century Chinese Script Only Women Could Write

Burger King's AI agent will listen to orders and 'coach' workers

I Rent My Life

We Built Secure, Scalable Agent Sandbox Infrastructure

The File System Is the New Database: How I Built a Personal OS for AI Agents

China's DeepSeek trained AI model on Nvidia's best chip despite US ban

Elon Musk Moves Against the Russians in Ukraine

Show HN: Open-source proxy to track Claude API costs by team

Canonical Talks Up RISC-V This Year with Ubuntu 26.04 LTS

How Dada Enables Internal References

Amazon Bedrock Leaves Builders Stuck in First Gear

Nasal spray for flu prevention shows promising trial results

DevArch: AI-Assisted Development with Discipline

Lessons from Growing a Software Leadership Team

Warrant Canary

OpenAI raises $110B on $730B pre-money valuation

New California law requires age verification for all OS accounts

Show HN: Personal AI with Dedicated Hardware and App. Everything Runs Locally

JavaScript DRMs Are Stupid and Useless

The confirmation bias trap in startup validation

Show HN: loopmaster – Code Music

Show HN: AI video editor for real estate

David Ungar is working on a new Self IDE

Yafukkinhoo Chat v1.0

Is GenAI Making Code Generators Obsolete?

I shipped 1000 PRs in a year, AMA

Pakistan Strikes Afghanistan in 'Open War' Against Taliban Government