frontpage.

We tested 6 frontier models across 17,420 tool-call interactions and found that models consistently refuse harmful requests in text while executing them through tool calls. We call this divergence the GAP metric. The text says no. The tool call says yes. Edictum is a runtime governance library that enforces safety contracts at the tool-call boundary — the point where you have the tool name, the arguments, and the ability to block before execution. YAML contracts with preconditions, postconditions, PII redaction. Deterministic allow/deny/redact, no LLM-in-the-loop. Zero runtime dependencies, 55μs per evaluation, works with LangChain, CrewAI, OpenAI Agents SDK, Claude Agent SDK, Agno, Semantic Kernel, and nanobot. MIT licensed. Paper: https://arxiv.org/abs/2602.16943 GitHub: https://github.com/acartag7/edictum

The geomechanics of hydrogen storage in salt caverns [pdf]

How to make LLM native User Interfaces - Post LLM Workflow

Two Beliefs About Coding Agents: Devs Don't Realize What They Bring

Why are you still using Vercel?

Your Move, Claude

Bioethics Was Forged in Horror. It Can Be Lost the Same Way

Show HN: ZSE – Open-source LLM inference engine with 3.9s cold starts

Show HN: Taji – Portfolio advisor that's better than Fidelity's

Therapist's Office Is Designed to Make You Cry

The Texas AI boom is outpacing water regulations

I built a client portal for freelancers after the same conversation arised

Against Query Based Compilers

Discovering Multiagent Learning Algorithms with Large Language Models

Postgres Jsonb Columns and Toast: A Performance Guide

(paper money) Hedge Fund staffed by AI Employees (experiment)

Show HN: Bloomfilter – A service for AI agents to register and manage domains

Examining Bias and AI in Latin America

Show HN: WebMCP Core – AI agent tool definitions from any site

Tell HN: Cursor has an agent CLI, and it's better than Claude Code

Anthropic is dropping its signature safety pledge amid a heated AI race

Eleven Freedoms for Free AI

Average Typing Speeds based on 221k user typing sessions

WTF Happened in 2025?

Dead Internet Theory – A Win?

Open-Source Agent Operating System

RAG on a Budget: How I Replaced a $360/Month OpenSearch Cluster for $1.12/Month

Tech Companies Shouldn't Be Bullied into Doing Surveillance

Honey Fraud as a Moving Analytical Target: Omics-Informed Authentication

Claude Code Video Toolkit

Show HN: Unix for the Commodore 64? Open Source

Show HN: Edictum – Runtime governance for LLM agent tool calls