frontpage.

Hi,I built a small local-first CLI toolkit for debugging AI/agent incidents.

Problem I kept hitting: building agents is fast, but when something breaks, handing off “one failing run” is messy (screenshots, scattered logs, partial configs, access to a tracing UI, accidental secrets/PII in payloads).

What this does: run your agent on a case suite and generate a portable evidence pack you can open offline and attach to a GitHub issue/ticket:

report.html (offline viewer)

compare-report.json (machine-readable summary for CI gating: none | require_approval | block)

evidence files referenced via a manifest (so you can verify completeness/integrity)

It’s intentionally self-hosted/local-only: no backend, no accounts, nothing leaves your environment unless you export the pack.

Redaction note: in the “production” pipeline, redaction is applied in the runner before artifacts are written (the agent is not required to support a special header). There’s also a strict mode that scans all manifest-referenced files for residual markers as a safety gate.

I’m not trying to replace tracing/observability tools — this is meant to be the “handoff unit” when sharing a link or granting UI access isn’t viable.

Questions for HN:

If you’ve had to share a single failing run with another engineer/vendor, what was the missing piece that caused the most back-and-forth?

What would you consider “minimum viable contents” vs a “bundle monster”?

US FDA reverses course, will review Moderna's flu vaccine

Seedance AI video demo debunked

Datacenters Behaving Like Acoustic Weapons [video]

Fast-growing trees are taking over the forests of the future

"Final" MJ Rathbun Post

AI-Native Observability Notebook

Journalism schools are teaching fear of the future

Support marine biodiversity research; choose name of a newly discovered species

Tell HN: OpenCode/Claude Code and Playwright CLI is great for front end dev

Show HN: Analytics that tells AI product teams where their AI fails user

Will I Be Paid in Tokens?

RustyClaw: Open-source multi-agent AI orchestration in Rust

Proton's width measured to unparalleled precision, narrowing path to new physics

Ditching Discord

Electric Vehicle Sales Boom as Ethiopia Bans Fossil-Fuel Car Imports

The West's Winter Has Been a Slow-Moving Catastrophe

Show HN: SalaryScript – The FAANG Negotiation Playbook

GPLv2 and Installation Requirements

Brain file format for AI agents – one file, any LLM, sub-millisecond queries

Google's Lyria 3: make 30-second audio tracks using text or images (in beta)

Spec driven development – new workflows and spec types

Ask HN: Do you build your own X?

Trump has prepared speech on extraterrestrial life, Lara Trump says

Querying OSM objects by their shapes

The History of Sushi

The Worst-Case Future for White-Collar Workers

Do the people building Claude understand what they've created?

Show HN: What We See. An AI generated art exhibition

Model collapse – how LLMs become worse when trained on their own output

Conversations with an AI That Argues Back

Show HN: Local "incident bundle" for AI/agent failures (offline rep and CI JSON)