frontpage.

I built an open-source framework where multi-agent systems fix their own prompts.

Each skill is a structured SKILL.md. After every run, an LLM judge scores each skill and tags exact failures. An LLM patcher generates candidate fixes to just the failing section. Each candidate is replayed on past traces. Winner gets promoted. Loser discarded.

One command: evoagents autofix

Key decisions: - LLM-as-judge, not regex — constraints are natural language, evaluation should be too - Section-level patching — only the broken part gets touched - Replay gating — no patch ships without proving it improves on real data - Versioned — every change is a new version, instant rollback

You can steer it: evoagents autofix --guide "prefer primary sources"

pip install evoagents https://github.com/jatingargiitk/evoagents

In The Pentagon Battle with Anthropic, We All Lose

"That Shape Had None" – A Horror of Substrate Independence (Short Fiction)

Anonymous Credentials: An Illustrated Primer

Show HN: Smart-commit-rs – A zero-dependency Git commit tool in Rust

Ask HN: Self Sustaining Codebases

Home of Astrophotography

Show HN: Ccbridge – A CLI to Orchestrate Claude Code and Codex

Show HN: War.direct – Real-time conflict intelligence dashboard for the Iran war

Show HN: Self-hosted AI agent observability (OTel, Grafana, bash hooks)

New Best Friend Is a Chatbot

OpenAI Just Got Anthropic's Pentagon Deal

Patience Toys

My weird ADHD to-do app, Do-dono

Show HN: Independent monitoring of AI API reliability

Rig for Turbulence

"Today may be the most important day of the 21st Century so far."

BullshitBench v2

iPhone-17e

Show HN: Qrvpn – deploy and run VPN server on any device and behind NAT/FW

Show HN: Valkey-powered semantic memory for Claude Code sessions

Helldivers 2 Adds Millennials as Its Newest Enemy Faction

Comparing Python packages for A/B test analysis (with code examples)

The last total lunar eclipse until New Year's Eve 2028-2029

A Rock Star Philosopher

Type-Safe JDBC: Schema-First Native SQL in Java

Show HN: Punch card simulator and Fortran IV interpreter

Show HN: Workz – run 5 AI agents on parallel Git worktrees with one command

We Audited 2,857 Agent Skills. 12% Were Malicious

Bars close and hundreds lose jobs as US firm buys Brewdog in £33M deal

Iranian strikes test the Gulf's trillion-dollar AI dream

Show HN: EvoAgents – Agents that evolve their own skills