frontpage.

We ran into a retry amplification issue in one of our LLM agents recently.

The provider returned 429s for a short period. We had per-call retry limits in place. We did NOT have containment at the request-chain level.

Because calls were nested and spread across multiple workers, retries multiplied in ways we didn’t anticipate.

Per-call limits were not enough.

For those running LLM systems in production:

– Do you implement chain-level retry budgets? – Shared circuit breaker state? – Per-minute cost ceilings? – Cost-based limits (tokens/$) rather than retry count? – Or is exponential backoff usually sufficient in practice?

I’m trying to understand what actually works at scale, beyond monitoring dashboards that tell you after the fact.

Google Maps now restricts its content if you're not logged in

The Sound of Contamination

Billionaires' Low Taxes Are Becoming a Problem for the Economy

Show HN: I built a serverless API for automated website screenshots

Lidar can reveal archaeological sites while overlooking Indigenous peoples

Do You Care?

OpenClaw 3D Virtual World

Email blunder exposes $90B Russian oil smuggling ring

Tnnl - Self-hosted ngrok alternative with free request inspection and replay

Introducing hipThreads: A C++ - Style Concurrency Library for AMD GPUs

Zero-cost delegates in .NET 10

Amazon's AI bots have been behind multiple AWS outages

Flathub is experiencing signing outage; new builds cannot be published

Software Collaboration in the AI Age

The Return of the Data Scientists

Show HN: Value of the Day – Rate your day with stars and reviews

Local iOS voice to text app (alternative to Wispr Flow)

Show HN: NotesGutter – Clean Code Notes with Markdown and Drawings

Fourth-quarter U.S. GDP up just 1.4%, badly missing estimate; inflation at 3%

Creatine: The Longevity Supplement Hiding in Plain Sight

You May Be Canadian

The Race to Give Every Child a Toy

Defense Dept. and Anthropic Square Off in Dispute over A.I. Safety

Show HN: A TUI text editor built by FTXUI

Everything you never wanted to know about visually-hidden

A History of Erlang (2007) [pdf]

Leverage Coding Agents as a Builder/Engineer [video]

Ask HN: How did you get your drive back?

Shading Languages Symposium 2026 content now available

eBPF the Hard Way

Ask HN: How do you prevent retry cascades in LLM systems?