frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Saguaro: CLI that makes Claude Code fix its own mistakes

https://github.com/mesa-dot-dev/saguaro
3•Mitchem•1h ago
I've been using Claude Code Max and Codex daily and kept hitting the same problem; AI quickly ships working code that have real issues: logic errors, security gaps, subtle regressions. You catch them in review, fix them, but the agent session has already closed. Doesn't it make sense to have the AI fix its own mistakes while it still knows why it made them?

Saguaro is a background daemon that reviews AI-generated code and feeds findings back to the same agent that wrote it. The agent evaluates the critique, it knows why it made those decisions in the first place, and self-corrects what's actually wrong.

The flow: you tell Claude Code to build something. Claude writes code. Saguaro's stop hook triggers a background review (the user sees nothing). On the next turn, findings come back to Claude. Claude says "I see some issues with my approach, fixing now" and corrects itself. No human typed anything. No blocking.

It uses your existing Claude Code / Codex / Gemini subscription. No API key needed. No external account. Everything runs locally. The daemon self-spawns on demand and auto-shuts down after 30 minutes of inactivity.

There's also a rules engine for teams that want more deterministic enforcement. You write rules as markdown files with YAML frontmatter, scoped to specific file globs. But the daemon works out of the box with zero rules. It reviews like a senior staff engineer: bugs, security, regressions, dead code. The rules engine adds more precision for teams/individuals that need it.

Setup is "sag init" + restart CC + go back to coding. That's it.

Apache-2.0. TypeScript.

Comments

Mitchem•1h ago
Hey HN, author here.

The thing that makes this work is where in the loop the review happens. CodeRabbit, Greptile, etc review at the PR level after the agent is done. The findings go to a human who has to interpret them. The agent that wrote the code never sees the critique. We find that most people just spin up a new agent and ask "Are these review findings correct?" anyways.

Saguaro reviews during the agent's session and sends findings back to the same agent. Because the agent still has its full context window, it knows why it made each decision, it can evaluate the findings intelligently. "I made this choice for X reason, but this review shows a gap in my thinking, let me fix that." Or "This finding isn't relevant because of Y." The agent has the context to make that judgment call. That's why false positives are lower.

The daemon is completely invisible to the user. It self-spawns from the Claude Code stop hook, runs a SQLite-backed job queue on localhost, and auto-shuts down after 30 minutes idle. The review happens in the background while the user keeps working. We feed context from the original programming session into the review process. The findings surface on the next stop hook, your agent just starts fixing things.

For teams that want more precision, there's a rules engine: markdown files with YAML frontmatter that enforce specific patterns (architectural boundaries, security invariants, etc). But the daemon works with zero rules out of the box. The rules engine works great for teams with well-defined rules.

Some technical decisions: - SQLite (via better-sqlite3) as job queue, right amount of infrastructure for a local dev tool. - The daemon reviewer gets the original agent's summary ("the developer described their work as...") for context - Agent gets read-only tools (Read, Glob, Grep) with up to 15 tool calls per review, it can inspect the full codebase for context but can't edit.

Limitations: - The daemon review is async. Findings arrive on the next stop hook, not the current one. Fast iterations may miss a cycle. - Review quality depends on the model. We default to your configured model but you can override for daemon specifically. - Cost is your normal AI provider usage. `sag stats` tracks it.

Happy to answer technical questions about the architecture.

A7OM•1h ago
Nice work! Claude Code users will love knowing they can also add our MCP server to get live inference pricing directly in their workflow. Useful for cost-aware agent development. a7om.com/mcp
prallo•1h ago
co creator of Saguaro here. just wanted to add that you can configure saguaro to run just a rules review, just a daemon bg review, or both after every pass your coding agent makes. We've seen the rules review complete anywhere between 1 to 5 seconds on average.

Show HN: I made a DAG MCP that supports complex tasks in Claude Code

https://github.com/jkerdels/dependency-graph-mcp
1•jochenk•2m ago•0 comments

Tetress = Tetris and Chess

https://tetress.com
2•kulesh•3m ago•0 comments

AWS is this Generation's Mainframe

https://leadprompt.sh/a/727-AWS-is-this-Generation%27s-Mainframe-2026w7
1•saltysalt•3m ago•0 comments

I create a fast C++ SAST tool to catch Vulnerabilities in ur code

https://github.com/CamranShahvali/SAST-AI-C-TOOL
1•camranshahvali•5m ago•0 comments

ReARM 26.03.59: More CycloneDX Usage in APIs and Better Bear Enrichment

https://rearmhq.com/news/2026-03-11-rearm-26-03-59-release/
1•taleodor•5m ago•0 comments

Army approves first new hand grenade since 1968

https://www.army.mil/article/290962/army_approves_m111_first_new_lethal_hand_grenade_since_1968
1•geox•11m ago•0 comments

Metabolic Pathways [pdf]

https://www.sigmaaldrich.com/deepweb/assets/sigmaaldrich/marketing/global/documents/261/398/metab...
1•signorovitch•12m ago•0 comments

Name the Risks Before Users Find Them (AI-Assisted Development)

https://vibe2value.com/name-the-risks-before-users-find-them/
2•mattcameron•12m ago•2 comments

Pulse.bot

https://www.pulse.bot/
1•hectorsaasrise•14m ago•0 comments

I Have 30 Years of Career Left. AI Made Me Rethink All of Them

https://newsletter.thelongcommit.com/p/i-have-30-years-of-career-left-ai
3•jcmartinezdev•14m ago•0 comments

Explain it like I'm 5: Why is everyone on speakerphone in public?

https://arstechnica.com/culture/2026/03/explain-it-like-im-5-why-is-everyone-on-speakerphone-in-p...
2•stalfosknight•16m ago•0 comments

Candy makers are phasing out real cocoa in chocolate

https://www.theguardian.com/us-news/2026/mar/11/reeses-hersey-chocolate-candy-cocoa
2•prmph•16m ago•1 comments

Do *you* understand ISO? (ISO setting explained to clear misconceptions) [video]

https://www.youtube.com/watch?v=ZWSvHBG7X0w
1•Rygian•19m ago•0 comments

Explainer: Drag Multiple Virtual Files Out of Browser

https://github.com/MicrosoftEdge/MSEdgeExplainers/blob/main/DownloadURL-list/explainer.md
1•joonehur•19m ago•0 comments

AI and the Mixed-Consistency Future

https://jhellerstein.github.io/blog/ai-mixed-consistency/
1•matt_d•20m ago•0 comments

Show HN: Manage Cursor agents from your smartphone

https://c100k.eu/p/rebootx
1•pmdfgy•21m ago•0 comments

Show HN: VSCode .env Autocomplete

https://github.com/Chrilleweb/vscode-dotenv-diff
1•chrillemn•24m ago•0 comments

Kona EV Hacking

http://techno-fandom.org/~hobbit/cars/ev/
1•AnnikaL•27m ago•0 comments

The Bank and Private Capital Shadow Venture

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5205679
1•petethomas•28m ago•0 comments

Runflow

https://runflow.io/
1•ricardoghekiere•29m ago•0 comments

AI productivity gains are 10%, not 10x

https://newsletter.getdx.com/p/ai-productivity-gains-are-10-not
1•donutshop•30m ago•1 comments

People Who Shun Super-Popular Pop Culture

https://www.theatlantic.com/culture/2026/03/pop-culture-hype-aversion/686312/
1•JumpCrisscross•30m ago•0 comments

A Crypto River Runs Through It

https://cepa.org/article/a-crypto-river-runs-through-it/
1•petethomas•30m ago•0 comments

If computers are the future why are users expected to be permanently illiterate?

https://lapcatsoftware.com/articles/2026/3/5.html
2•zdw•30m ago•0 comments

Anthropic has strong case against Pentagon blacklisting, legal experts say

https://www.reuters.com/legal/legalindustry/anthropic-has-strong-case-against-pentagon-blacklisti...
6•tartoran•31m ago•0 comments

US may have struck Iranian girls' school after using outdated targeting data

https://www.reuters.com/world/middle-east/us-may-have-struck-iranian-girls-school-after-using-out...
3•tartoran•32m ago•1 comments

Base44 Superagents

https://base44.com/superagents
1•yoavfr•33m ago•0 comments

US inflation stable ahead of Iran shock

https://www.bbc.com/news/articles/cde4w32573xo
1•tartoran•33m ago•0 comments

Decision Guardian: My First GitHub Action and CLI Project

https://github.com/DecispherHQ/decision-guardian
3•poor_hustler•34m ago•0 comments

Show HN: A crowdsourced wiki tracking design origins in Pickmon

https://pickmonfans.com/
1•lion__93332•34m ago•0 comments