frontpage.

Built this over the past few months because I kept hitting the same wall with agent frameworks. You run something, it does... stuff, and then you're left trying to figure out what actually happened and why.

KarnEvil9 is a TypeScript runtime that implements the DeepMind delegation paper from earlier this year (Tomasev et al.). The core idea is pretty simple: every action goes into a SHA-256 hash-chain journal, agents earn trust through a Bayesian scoring model, and there are actual economic stakes via escrow bonds. If an agent screws up, it loses its bond. If it keeps failing, the futility monitor kills the loop.

The fun part was testing it on Zork I. I set up three agents in a swarm: one plans moves, one executes them against a Z-machine, one independently verifies game state. The governance layer immediately blocked the agent from attacking the troll because it classified "attack" as high-risk. Took me a while to realize the fix wasn't to whitelist attack commands, it was to make the system trust-aware so an agent with a good track record can take riskier actions.

The other thing I didn't expect: when Eddie (the autonomous agent that runs 24/7 on this) hit the Anthropic API credit wall, the futility monitor halted everything, and Eddie's next plan included switching to cheaper models for routine code reviews. Nobody told it to optimize costs. That came out of the delegation framework's cost-awareness primitives.

Happy to answer questions. https://oldeucryptoboi.com

Show HN: Stoneforge – Open-source orchestration for parallel AI coding agents

ChatGPT vs. MOSQUITO Trolley Problem [YouTube] [video]

Attempted Hack of Water Treatment Plant in 2021 [pdf]

Mac Studio 512GB RAM Option Disappears Amid Global DRAM Shortage

Cluely Retracts June 2025 Revenue Statement

Auto update and visualize your AI chat context

A family need transformed into a simple learning tool

Show HN: Kybernis – Prevent AI agents from executing the same action twice

Triumph of the toons: how animation came to rule the box office

What Happens When We Die

How Legal Punishment Affects Crime: Law's Punitive Behavioral Mechanisms (2025)

Dereks at Work: what would it mean for an AI agent to be "accountable"?

Show HN: SafeAppeals – Cursor for Documents

Jj v0.39.0 Released

As AI Turns Prevalent, UI Becomes Irrelevant

Show HN: FlowLessAI – NPM I -g vibe-auditor – AI audits your codebase

Trajectly – deterministic regression tests for AI agents

Are You Noticing This?

Snapdragon ARM laptop overtakes Intel's flagship Panther Lake in benchmarks

Sub-10-Second Database Boot on Kubernetes with Full Isolation

United Airlines can permanently ban passengers who don't wear headphones

Why Does Child Care Seem Less Affordable Than Ever

10–97% in nine minutes: BYD presents second generation of Blade Battery

Sam Altman Admits OpenAI Can't Control Pentagon's Use of AI

Show HN: I built an AI exam prep platform for AWS certs after failing one myself

Documentation Is a Message in a Bottle

Dcsctp: An SCTP Implementation for WebRTC Data Channels in Rust

Ask HN: How do you keep AI coding agents aligned with your codebase standards?

Ask HN: Do You Enjoy Your Career in Tech Nowadays?

You can just register new holidays

Show HN: KarnEvil9, a deterministic AI agent runtime