frontpage.

aft was my stab at having a way to understand what claude is doing and also having the language to reason about differences in model behavior when we make them do long agentic runs / change prompts / alter tools etc. The intention of the toolkit to provide an empirical measure of how agent behavior can differ as things changes like environments, tools, prompts etc.

It gives the tools to measure the changes in "behaviors that the users define". This means that it is more like a hypothesis testing framework for what the agent is doing over actually telling what the agent might do.

The reasoning and derivations behind these tools is given over here https://technoyoda.github.io/agent-science.html

Would be very happy to hear feedback and questions. (Please ignore the names given to theorization, it was for shits and giggles)

Code Corners: A platform-agnostic alternative to GitHub Corners

Show HN: M8tes.ai – AI teammates that do your work

Prolonged U.S.-Iran conflict could trigger major energy shock in eurozone

Does the war on Iran prove it's time to quit oil for good?

Mobian Hiring Sr/Staff Engineers (Remote) AI-First Systems and Prototyping

The 185-Microsecond Type Hint

Welcome to My Virtual Office

Will AI Replace You?

Show HN: ThinkFirst – The Anti-ChatGPT for Students

What to know about the Strait of Hormuz

ToolMisuseBench: A deterministic benchmark for tool-augmented Agents

Deception, Lies, and Valve [video]

Music Streaming Economics: 6.9M Streams and a Full Cross‑Platform Payout Dataset

8th Wall is now open source

Show HN: Drawbridge – Drop-In SSRF Protection for Python

Show HN: Veread – A minimal RSS reader for web. No sign ups. No downloads

1 Dataset 100 Visualizations

Building a Software Career in an LLM World

Ask HN: How are you preventing runaway LLM workflows in production?

Associated Press Announces It's Teaming Up with Kalshi Ahead of the Midterms

Why (and how) we built 3 AI agents into our product

February 2026: Bitcoin fell 24%. Nothing in crypto infrastructure broke

YCombodogpatchrental

Only 526 AI tools are in the topM most-visited websites

Ask HN: How do you get better at coding agents?

This Month in Ladybird – February 2026: Adopting Rust for LibJS

Show HN: Vanilla JavaScript refinery simulator built to explain job to my kids

What keeps IoT devices running for a decade

Show HN: Synapse – P2P AI agent collaboration with async human supervision

AI-generated code, AI-generated findings, and the verification bottleneck

Show HN: Aft, a Python toolkit to study agent behavior