frontpage.

I built this over the weekend because agent systems seem to jump from static guardrails to much heavier judge-style loops, with not much in between.

AgentUQ uses provider logprobs to localize brittle action-bearing spans in an agent step and route actions like continue, retry, verify, ask for confirmation, or block.

The claim is intentionally narrow. It isn’t trying to determine truth. I know uncertainty work can feel theoretical, so I wanted to test whether a smaller operational use of it could be useful in practice.

My bigger belief is that agents need infrastructure that learns from production history (failed runs as well as unconfident ones) instead of just accumulating patches. This is one concrete experiment in that direction.

Tommy DeCarlo, Boston Fan Who Became Their Lead Singer, Dead at 60

The Bay Area Considers the Unthinkable: Life Without BART

A Methodological Critique of "First Proof" (Abouzaid et al., 2026)

Umbra Open Data Tracker

Show HN: A tool for arranging photos for home-printing without wasting paper

I've never parented a 6-year-old child. But I've dealt with macOS system updates

Rising Air-Conditioning Use Intensifies Global Warming

Exigy Shareware Construction Kit

Every paper published in the Bridges Conference on art and mathematics

I built a RabbtiMQ UI alternative because its not 2005 anymore

How God Got So Great

Texas women used crow drones to fly drugs into Louisiana prison

Local-First CI/CD with Makefiles

I'm 21 and My Startup Scalify.ai Is Going Viral

Show HN: RunAnwhere – Faster AI Inference on Apple Silicon

"This guy's arrogance takes your breath away" (2016)

The evolution of Mac app window corners

Gateproof: Build Software in Reverse

Philippines shifts to four-day work week as Iran war pushes oil prices up

Kung Fu: The Way of Violence Has No Mind [video]

Team simulates a living cell that grows and divides

Isotopic Evidence for a Cold and Distant Origin of Interstellar Object 3I/Atlas

Defeat as Method

Ad-tech is fascist tech

Berth – One-command deploys for AI-generated code (no Docker, no YAML)

Writing an LLM from scratch, part 32e – Interventions: the learning rate

Microsoft Copilot Update Hijacks Default Browser Links

Is Music Just Sound?

Starting to building an open-source tool to track how AI agents search the web

A funny game:Guesss AI or human?

Show HN: AgentUQ, a token-logprob runtime gate for LLM agents