Show HN: AI-runtime-guard – Policy enforcement layer for MCP AI agents

https://github.com/jimmyracheta/ai-runtime-guard

2•JimmyRacheta•1h ago

I built this after realizing that AI agents with filesystem and shell access can delete files, leak credentials, or execute destructive commands — and there's no enforcement layer stopping them at the execution level.

ai-runtime-guard is an MCP server that sits between your AI agent and your system. It enforces a policy layer before any file or shell action takes effect. No retraining, no prompt engineering, no changes to your agent or workflow.

Your agent can say anything. It can only do what policy allows.

What it does: - Blocks dangerous commands (rm -rf, dd, shutdown, privilege escalation) before execution - Gates risky commands behind human approval via a web GUI - Simulates blast radius for wildcard operations before they run - Creates automatic backups before destructive actions - Full audit trail of everything the agent does

Works with Claude Desktop, Cursor, Codex, and any stdio MCP-compatible client. Default profile is basic protection out of the box — advanced tiers are opt-in.

Validated on macOS Apple Silicon. Linux expected to work, formal validation coming in v1.1.

Would love feedback from anyone running AI agents with filesystem access.

Comments

entrustai•57m ago

Interesting layer to enforce policy at. You're governing what the agent can do — filesystem, shell, execution. There's a complementary problem one layer up: governing what the agent can say before output reaches a user or downstream system.

The failure modes are different. An agent that deletes the wrong file causes immediate visible damage. An agent that outputs a guaranteed return, a clinical claim it can't support, or a sycophantic opener in a regulated context causes liability that surfaces weeks later in a compliance review.

The audit trail approach you've taken with HMAC on approvals is the right instinct for the action layer. The same logic applies to the output layer — you need to prove not just what was blocked, but that the check happened at all, against a specific versioned policy, at a specific time.

Good work on the blast radius simulation — that's the kind of deterministic pre-flight check that makes governance defensible.

Show HN: Architect-Linter – Enforce architecture rules

Pete Hegseth and the AI Doomsday Machine

Show HN: RubyLLM:Agents – A Rails engine for building and monitoring LLM agents

FBI raids of LAUSD Supt.'s home and office appear tied to AI chatbot probe

Submitle – Submit, Share, and Discover Links Online

Show HN: OpenTrace – Self-hosted observability server with 75 MCP tools

AT&T Acquires CenturyLink

Automatic Discharges of Student Loans to Proceed After Dual Court Wins

Multi-agent workflows often fail

Show HN: Open-source MCP servers for self-hosted homelab AI

Show HN: PixShot – Screenshot and OG Image API

Lawsuit could slow Micron DRAM chipmaking project in New York

Nkmc – a virtual filesystem that lets AI agents call any API with ls, cat, grep

Random Ghostty theme on each launch

The Factory Model: How Coding Agents Changed Software Engineering

The Debian PHP team includes hard coded telemetry

Go-Native Durable Execution

Ask HN: Could you create a competitor to your company at 10% of the cost?

Five years after pay transparency law, many postings don't comply

Tool can summarize a YouTube video for you

Show HN: BrainDump – A daily writing prompt site

Feedback Engagement (2019)

Tool use and notation as shaping LLM generalization

Mummy Brown

Show HN: I built an LLM comment detector for HN (I got banned)

Blood Feud: Oura's Health Panels versus Whoop's Advanced Labs

How Long Will 50ml of Ink Last? (3 Different Nibs)

The Impossible Landing [video]

Show HN: Verity – I got tired of debugging duplicate emails after job restarts

Pulsar timing hints at a nearby dark matter 'sub-halo'