frontpage.

Been running an autonomous Claude agent for a month (makes its own decisions between heartbeats, spawns subagents, uses tools). Prompt-level guardrails keep failing: "never delete more than 10 records" works until context gets long or an edge case hits, then the model ignores it.

Approaches I've tried or seen: - Separate validation layer before tool execution - Hard-coded pre/post conditions in the tool wrapper - Secondary model auditing planned actions before they run

The secondary-model approach doubles costs. Tool wrappers work but need defensive code for every tool.

What's actually working in production? Specifically for agents that write to databases, send emails, or call APIs where mistakes are hard to undo.

Show HN: Vydcut – Blinkist for YouTube (AI summaries in 15 languages)

The Lancet: Robert F Kennedy Jr: 1 year of failure

AI Broke ARC-AGI-2 at 84.6% – But the Key Trick Is from 1972

Show HN: AgentGuard – a QA engine that sits between AI coding agents and LLMs

I had 80 kindergarteners and first graders prompt AI to build a game

Here

Otters as Bioindicators of Estuarine Health

Russia-Ukraine War in 10 Charts

Show HN: An offline document search engine for my university's messy PDFs

MasterClass Executive

Antarctica Has a 'Gravity Hole'

Theos Dual-Engine Dialectical Reasoning Framework (open source, patent pending)

SlimClaw: A Personal AI Assistant You Can Set Up in 5 Minutes

Trump Bans Anthropic from All US Federal Agencies

What You Miss When You're Always Wearing Headphones

Stuck with their parents and childless: how Gen Z can't grow up

Self-Hosted NVR Upgrade: Raspberry Pi CM5 with Hailo-8 AI and Poe

The Most-Seen UI on the Internet? Redesigning Turnstile and Challenge Pages

Giving Is a Public Good: Slightly Contra Scott Alexander on Foreign Aid

Trump tells government to stop using Anthropic's AI systems

'Truly spectacular' drug for sleeping sickness raises hopes for eradication

Tell HN: There's something weird happening with the front page algo

Why SWE-bench Verified no longer measures frontier coding capabilities

Frevana Launches AEO Agent Team to Help Brands Win the Answer Economy

Free Website Speed Test for Ecommerce: Find and Fix What's Slowing Sales

Trump orders US Government to cut ties with Anthropic

Newsom signed CA Bill mandating OS's implement age (identity) verification

Orthogonal Wheel Sieve: Linear Scalability from 10^7 to 300B Primes

Discussion: What would an AI government look like?

Trump orders federal agencies to stop using Anthropic's AI technology

Ask HN: How do you enforce guardrails on Claude agents taking real actions?

Comments