frontpage.

I’ve been building a lot with LLMs lately and kept thinking: why doesn’t this tool exist?

The workflow usually ends up being: write some code, run it, tweak a prompt, add logs just to understand what actually happened. It works in some cases, breaks in others, and it’s hard to see why. You also want to know that changing a prompt or model didn’t quietly break everything.

Reticle puts the whole loop in one place.

You define a scenario (prompt + variables + tools), run it against different models, and see exactly what happened - prompts, responses, tool calls, results. You can then run evals against a dataset to see whether a change to the prompt or model breaks anything.

There’s also a step-by-step view for agent runs so you can see why it made a decision. Everything runs locally. Prompts, API keys, and run history stay on your machine (SQLite).

Stack: Tauri + React + SQLite + Axum + Deno.

Still early and definitely rough around the edges. Is this roughly how people are debugging LLM workflows today, or do you do it differently?

What can we remove? (2024)

Genetically modified bacteria convert plastic waste into Parkinson's drug

If you thought the code writing speed was your problem; you have bigger problems

GhostNet: A Community-Driven Global OSINT Network over Shortwave Radio

Certifications with the best ROI per hour in 2026

Show HN: Specimen – Font Manager for macOS

Nornr: Give your agent a spending mandate before it touches money

Arizona Attorney General sues Kalshi on illegal gambling charges

Anti-Private Equity Is Good Business

Show HN: I built a 7-agent AI marketing crew – 235 replies, /bin/zsh revenue

Real-World Industrial-Scale Verification: LLM-Driven Theorem Proving on SeL4

Show HN: Complete Guide to AI Agent Observability in Production

The Byzantine MCP Router – AI Safety and Security via Semantic Consensus

More Big Tech Layoffs Loom as Meta Mulls 20% Cut to Its Workforce

Oils for Unix – A Pause in the Project

Free alternative to Harvey/Legora's tabular document review

Pgtui, a Postgres TUI Client

Migrating from DigitalOcean to Hetzner

What Does the Future of Programming Look Like?

Simulation we live in was created to develop AGI, and will soon be turned off

RocketRide – Build and run AI/data pipelines within VS Code, Cursor etc.

I canceled my Antigravity subscription today. Here is why

Tab Organizer for Developer

Email for agents – agent doesn't need another Gmail

Webtool: Let AI agents control your live Chrome session with CDP

Show HN: Railguard – A safer –dangerously-skip-permissions for Claude Code

Top US counterterrorism official resigns over Iran war, urging Trump to 'reverse

Ecological Institutionalism: Toward a Constitutional Architecture for Reciprocal

Less code, more power: Why we rolled our own React Server Components framework

QuickBooks Online MCP Server

Show HN: Reticle – Postman for AI Agents

Comments