frontpage.

I’m working on the system architecture for a high-throughput AdTech DSP and would love feedback from people who’ve built large-scale bidding / serving systems.

Constraints / Goals

DSP only (no exchange)

Target: 1M ad requests/sec

End-to-end DSP latency budget: ~100ms

Pricing model: CPM

Hard requirement: no advertiser or campaign overspend

Targeting / Campaign Fetch

I modeled targeting (geo, interests, etc.) using Redis + Roaring Bitmaps.

Fetching candidate campaigns alone:

Redis: ~1000 RPS at ~8ms (local machine, not cloud)

Aerospike: ~200–400 RPS at ~10ms

This is only campaign fetching, not bidding or scoring.

Budget / Wallet Model

Advertiser has a wallet

Campaign has:

Total budget

Daily budget

Daily spend tracking

Overspend is not acceptable (even a small % matters at scale).

Budget Control Approaches Considered

Splitting daily budgets into hourly buckets

Rate limiting via:

Token bucket

PID controllers

These reduce overspend but don’t guarantee correctness under bursty traffic.

Recently considering micros (integer currency units) to reduce rounding errors.

Open Questions

At 1M QPS, how do people actually enforce budget guarantees in production?

Soft overspend with reconciliation?

Hard atomic checks in the hot path?

Is Redis bitmap–based targeting viable at this scale, or does everyone eventually:

Pre-materialize campaign sets?

Push logic into memory / C++?

How do you balance:

Strict budget enforcement

Low latency

High throughput without introducing global locks or cross-region contention?

Is “no overspend ever” a realistic requirement, or is bounded error the industry norm?

I’m less interested in textbook answers and more in what has actually worked (or failed) in production.

South Korean crypto firm accidentally sends $44B in bitcoins to users

Apache Poison Fountain

Web.whatsapp.com appears to be having issues syncing and sending messages

Google in Your Terminal

Shannon: Claude Code for Pen Testing

Anthropic: Latest Claude model finds more than 500 vulnerabilities

Brooklyn cemetery plans human composting option, stirring interest and debate

Why the 'Strivers' Are Right

Brain Dumps as a Literary Form

Agentic Coding and the Problem of Oracles

Malicious packages for dYdX cryptocurrency exchange empties user wallets

Show HN: I built a <400ms latency voice agent that runs on a 4gb vram GTX 1650"

Penisgate erupts at Olympics; scandal exposes risks of bulking your bulge

Arcan Explained: A browser for different webs

What did we learn from the AI Village in 2025?

An open replacement for the IBM 3174 Establishment Controller

The P in PGP isn't for pain: encrypting emails in the browser

Show HN: Mirror Parliament where users vote on top of politicians and draft laws

Ask HN: Opus 4.6 ignoring instructions, how to use 4.5 in Claude Code instead?

We Mourn Our Craft

Jim Fan calls pixels the ultimate motor controller

Exploring a Modern SMTPE 2110 Broadcast Truck with My Dad

AI UX Playground: Real-world examples of AI interaction design

The Field Guide to Design Futures

The Other Leverage in Software and AI

AUR malware scanner written in Rust

Free FFmpeg API [video]

Are AI agents ready for the workplace? A new benchmark raises doubts

Show HN: AI Watermark and Stego Scanner

Clarity vs. complexity: the invisible work of subtraction