frontpage.

We’ve been pushing LLM backed workflows into production and are starting to run into reliability edges that observability alone doesn’t solve.

Things like:

- loops that don’t terminate cleanly

- retries cascading across tool calls

- cost creeping up inside a single workflow

- agents making technically “allowed” but undesirable calls

Monitoring here is fine. We can see what’s happening. The harder part is deciding where the enforcement boundary actually lives.

Right now, most of our shutdown paths still feel manual, things like feature flags, revoking keys, rate limiting upstream, etc.

Curious how others are handling these problems in practice:

- What’s your enforcement unit? Tool call, workflow, container, something else?

- Do you have automated kill conditions?

- Did you build this layer internally?

- Did you have to revisit it multiple times as complexity increased?

- Does it get worse as workflows span more tools or services?

Would appreciate any concrete experiences from teams running agents in production. Really just trying to figure out how to scale.

A Day in the Life of an Enshittificator [video]

Show HN: BridgeBase – one control plane for TigerBeetle,Redis,MySQL,ClickHouse

Guidelines for Writing Cryptography Specifications

AI is creeping into election campaigns. NZ's rules aren't ready

White House stalls release of approved US science budgets

Show HN: Open-Source Postman for MCP

" I've got the guns," is a wild government argument for tech pundits to support

Cavaro – One platform for design docs and architecture diagrams

Astro and Svelte: Why I believe they're the future of web development

Bounded Plasticity Simulation

Clueless cops post seized crypto wallet password. $5M quickly stolen

AI First Application Development

The world wants to ban children from social media with grave consequences

Show HN: WaitQ – queue management and waitlist system

How to Build Your Own Quantum Computer

AI Architecture Pattern Manager – Togaf ABB/SBB/PBC with Neo4J

Slop Definitions Were My Final Straw with Google Search

Vouch

Musk's fossil data centres are undoing Tesla's climate benefit

Exploiting Iran: A Political Timeline

Poll: Which VCs Are Tier 1?

Nobody ever got fired for using a struct

Origin of the Abbreviation i18n

Fast Biology Bounties

Vibe Theory: Mathematical Derivation of Aesthetic Vibe from Text

2x Qwen 3.5 on M1 Mac: 9B builds a bot, 0.8B runs it

Five People in Their 60s, 70s, and 80s Share How They Plan to Age at Home

Show HN: AgentBrowser Token-efficient browser for AI agents via ASCII wireframes

Microsoft Creative Writer (1993)

Show HN: Parallax – Coordinate adversarial AI agents over durable streams

Ask HN: How are you preventing runaway LLM workflows in production?