frontpage.

Orchid (Orchestration interactive debugger) is a zero-instrumentation proxy that captures every API & LLM call in your agent pipeline, then lets you inspect and replay the entire run locally, step by step. No instrumentation, no vendor lock-in, no cloud dependency. It also provides a visual inspector and MCP server, so you can inspect the session yourself or use your favorite agentic coding IDE to debug your agent runs.

I built it because I was tired of debugging agent failures by grepping through logs, and the available AI observability tools all seemed to require intrusive instrumentation and/or sending my prompts and responses to a cloud service. I wanted something that would let me debug agent runs locally, without having to worry about vendor lock-in or data privacy.

Orchid is that tool. The call inspection features work extremely well, at least for my use cases, but the replay feature is perhaps more interesting. It makes LLM pipeline testing deterministic without mocking or re-running expensive API calls.

Free, self-hosted, runs on your machine or infrastructure: https://github.com/mario-guerra/orchid-trace

Would love feedback from anyone building multi-step agentic systems or struggling with non-deterministic LLM test failures.

Drastically Reduce Stress with a Work Shutdown Ritual – Cal Newport

The AI Data Centre Legal Case That Could Eradicate Civil Rights

Why big AI labs are hiring so many philosophers

What does your eval measure?

Show HN: Tuip – CLI / TUI for checking SaaS vendors' statuses

Loops Burn Tokens

Show HN: Gifhub, bug hunter that shows instead of tells

The Bargain. Or what America forgot and Europe still keeps

The Xteink X4 E-Ink Reader

Sentrup – AI Customer Support Platform

Exploiting vulnerabilities in Johnson and Johnson web apps

Show HN: Cutlistor – Instant cut list optimizer with 3D Model and PDF Import

I crawled 827 employers' career sites to measure ATS market share

Germany's Kai Havertz: 'I make runs that look pointless but I'm creating space'

Ask HN: How much coding should beginners learn in the AI era?

Show HN: Empowering codex/Claude Code with Aswath Damodaran valuation thinking

Building a LoFi Radio

Show HN: Metaspec: The DpANS3R Common Lisp Spec in S-Expr and HTML Format

Show HN: Browser based tool for programming ch57x macro-pads

Create cross-platform mobile apps with Ruby

Show HN: (Spotlight/Raycast for Web Search not local) && (compare AI responses)

How to Measure the ROI of FDE

Show HN: LinkedIn Remote jobs by technology and country Map. Joint effort.

Seoul: AWS and Google Cloud Kept Failing the Same Network Path?

Human Dignity – On the Perils of Indifference

Claude Agents in Notion

Fable – Is it ever coming back?

Retracted: Paper claiming immunochemotherapy more effective in morning

Agentic Design Patterns

ModelFit – find the cheapest LLM that can back up your main coding model

Show HN: Orchid – Local-first record and replay for AI agent debugging