frontpage.

Coding agents are getting pretty good at writing code, but they still have no way to verify if they break the UI.

As humans, we rely on visual diffs for that. We open them, scan quickly, and catch obvious regressions. Agents are completely out of that loop.

I’m a co-founder of Argos (visual testing), and I recently shipped a CLI to expose visual diffs in a way an agent can actually use, instead of going through a UI.

Once I wired it into an agent workflow, a few interesting things started happening. The agent started catching obvious regressions. Sometimes it would refuse to approve its own PR. With a good prompt, it even fixed the issue after seeing the diff and iterating.

It’s still rough and not reliable enough to trust on its own. A lot depends on how well the agent understands the codebase. In local tests, it sometimes gets stuck in loops and burns through tokens .

Giving agents “eyes” on UI changes might be an interesting feedback loop for more autonomous dev agents in the future.

Billionaire backer sues Trump family's crypto firm over alleged extortion

We found a stable Firefox identifier linking all your private Tor identities

Trump Wants to Double Production of New Nuclear Weapon Cores

When the AI Cloud Comes for Texas Water

AI Tools Are Helping Mediocre North Korean Hackers Steal Millions

Honey, I Shrunk the Coding Agent

New York bans state employees from insider trading on prediction markets

Why Gen AI Isn't Quite Cost-Effective at Creating 3D Game Worlds

The Mystery of the Giant Blobs at the Center of the Earth

How to program computers (kos) [video]

Compiler Jokes

EML compresses calculator syntax; Phase Calculus places it one layer downstream

Trees of New York City

Show HN: We built Cursor, but for data transformations [Open Source]

New Kind of Paper (2021)

I almost signed a lease that would have cost me thousands

What if we start to draw inspiration from nature's greatest machine?

Show HN: DrakeAI – AI expense tracker you log by texting (iOS and Android)

Google's 8th Generation TPUs Power the Agentic Era [video]

Show HN: A Swift Payment message validator built from Swift Standard rules

Live hooks – simple missing patterns for predictable hooks in async React code

Building design system components with agent teams

Physicists think they've solved the muon mystery

Show HN: Clawrium – A CLI for managing AI agent fleet across multiple instances

Ask HN: Do financial stakes improve long-term consistency?

Markdown (Aaron Swartz: The Weblog)

Surveillance Pricing: Exploiting Information Asymmetries

I Forked 4 CLI coding agents to Run the Same Model. I found a 2x gap

New electric cars now cheaper than petrol models for first time in UK

Agent Vault: The Open Source Credential Proxy and Vault for Agents

Giving agents "eyes" on visual diffs