frontpage.

I think in-process key management is the right abstraction for multi-key LLM setups. Not LiteLLM, not a Redis queue, not a custom load balancer.

The failure modes are well-understood: a key gets rate-limited, you wait, you try the next one. Billing errors need a longer cooldown than rate limits. This is not a distributed systems problem — it's a state machine that fits in a library. The problem is everyone keeps solving it with infrastructure instead. Spin up LiteLLM, now you have a Python service to maintain. Reach for Redis, now you have a database for a problem that doesn't need one. key-carousel manages a pool of API key profiles with exponential-backoff cooldowns: 1min → 5min → 25min → 1hr for rate limits, 5hr → 24hr for billing. Falls back to OpenAI or Gemini when Anthropic keys are exhausted. Optional file persistence. Zero dependencies.

Show HN: Crypto data API where AI agents pay per request with USDC (x402)

The first AI counter surveillance app

Loop Conference Channel [YouTube]

The Mystery of Asjo.org

How College Admissions Officers Spot Over-Coached Applications

Our Hospice System Subverts the Point of Hospice Care

SEIU Delenda Est

Tell HN: Azure Data Factory pipeline execution delays in East US 2

Show HN: ByeBrief – a local-first AI investigation canvas

The Differentiated Engineer in the Era of Automated Development

Defense Devaluation – Starlink on American Drones

India Plans 30% Slash in Thermal Coal Imports This Year

I made a programming language with M&Ms

Show HN: MysteryMaker AI

Peer-to-Peer Networking: Build a VPN Tunnel with Wintun on Windows – Part 2

UUID package coming to Go standard library

US draws up strict new AI guidelines amid Anthropic clash

T3 Code – a new OSS agentic coding app that wraps Codex

Show HN: HyperClaw – self-hosted AI assistant that replies on Telegram/Discord/+

Rust 1.94.0

Natural Language AutoCoder Open SOurce

Show HN: Claude-consensus – Multi-model code review plugin for Claude Code

BYD unveils Blade Battery 2.0: 10-70% in 5 mins, 10-97% in 9 mins

Show HN: Copyworks – Chinese character worksheets with tone colors

Saulala

Qatar warns war will force Gulf to stop energy exports 'within days'

FASTEST LLM decode engine on Apple Silicon. 658 tok/s on M4-Max,beats MLX by 19%

T3 Code: A Minimal Web GUI/Desktop App for Coding Agents

I built a database of verified YouTube channel revenues

Cancellation of Army exercise fuels speculation about Mideast troop deployments

Show HN: key-carousel - Key rotation for LLM agents

Comments