Show HN: Railyard – open and secure runtime for Claude Code

3•LunarFrost88•1h ago

We're a small startup (but have ~15 years of experience building software), so we try to run Claude Code as autonomously as possible. The goal is to spend most of our time talking to customers instead of babysitting the agent. But --dangerously-skip-permissions felt a bit too wild west for us.

So we built a runtime to make autonomous use safer. Railyard is an open-source runtime that sits between Claude Code and the shell and adds guardrails to agent commands.

Every command Claude runs goes through Railyard first. Most commands pass straight through. The ones that could cause damage (for example terraform destroy) get blocked or require approval.

Under the hood it runs commands inside an OS-level sandbox (sandbox-exec on macOS and bwrap on Linux) and applies deterministic rules. There’s no LLM scoring commands or guessing about intent — a command either matches a rule or it doesn’t. The check takes about 2ms.

By default it blocks destructive commands like terraform destroy or rm -rf, prevents access to sensitive paths like ~/.ssh, ~/.aws, and /etc, restricts certain network calls, and catches simple evasion tricks like base64, hex, or variable obfuscation.

It also snapshots file writes so you can roll back a session if something goes wrong.

In practice this lets us run Claude Code with --dangerously-skip-permissions, but with guardrails underneath so we can move fast without breaking or deleting production assets.

We built this because we wanted Claude Code to behave more like a software factory. Factories run at high volume, but only because the production line has quality and safety checks. Railyard is the guardrail layer that makes that possible for us.

Repo: https://github.com/railyarddev/railyard

It's MIT licensed and free to use. If you're experimenting with autonomous agents, feel free to clone it and try it out. I'm especially curious how people push or break these guardrails.

Happy to answer any questions about how it works.

Comments

joaquin_arias•1h ago

This looks really useful! I like how you added OS-level sandboxing and deterministic guardrails instead of relying on LLM-based intent checks — that feels much safer for running autonomous agents.

Curious: have you tried integrating this with multi-agent setups, where multiple Claude Code instances interact? I wonder how the guardrails would scale when agents start triggering each other’s commands.

Also, do you have plans for a lightweight visualization dashboard for monitoring blocked vs allowed commands in real time? It could help developers trust the system more quickly.

LunarFrost88•27m ago

Thanks for the feedback. Love the point about the visualization dashboard, will add that now!

>> have you tried integrating this with multi-agent setups, where multiple Claude Code instances interact?

We wanted to solve for the most frequent use case first (single-agent execution), but multi-agent is definitely on the cards. If you've got some use cases in mind, let me know and we'll apply Railyard to it.

simosmik•20m ago

That’s nice work guys. Knowing anthropic, their auto-mode which releases on the 12th is going to leave a lot to be desired

Nvidia's Groq Plot Thickens – The Chip Letter

The Latest Republican Efforts to Make It Harder to Vote in the Midterms

The Dark Factory Is a .dot file

Uber uses AI for development: inside look

Iowa Payphone Defends Itself (Associated Press, 1984)

Show HN: Quick Look Source Code in Finder on macOS

Against Vibes: When Is a Generative Model Useful

Show HN: KaraMagic – automatic karaoke video maker

What comes after agents? AI employees

Photocopier No More: The Reckoning with AI Creativity Has Arrived

Inverse Occam's Razor

Tell HN: Apple development certificate server seems down?

Mother of All Grease Fires

6-Axis Milling for Enhancing Quality of Fused Granular Fabrication Parts

Working to Decentralize FedCM

Agent-sync – sync between Claude Code and Codex configs

Helix 02 living room tidy

Don't let LLMs write for you

Deep Learning: Our Year 1990-1991

Ask HN: I built an AI-native codebase framework–could you evaluate it?

The Slowest Viral Thing

SoftBank eyes up to $40B loan to fund OpenAI investment

SEIA Solar Market Insight Report 2025 Year in Review

A vertical tab companion app for aerospace window manager

Uber rolls out women-only option in the US

Meta Is Buying Moltbook

GoT Timeline – a daily timeline game to test your Game of Thrones skills

Claude Code makes local LLMs 90% slower

Eventbrite Enters into Definitive Agreement to Be Acquired by Bending Spoons

Why doesn't V8 fit on my microcontroller? (2021)