Are we defaulting to VM-level sandboxing before understanding the threat model?

2•samhita-alla•1h ago

Hey everyone, I'm Samhita and I work at Union.ai. We've been building infrastructure for running agents and building models, which naturally got us thinking a lot about sandboxing. One thing I've been wondering: are we overusing heavyweight sandboxing solutions?

I think some form of isolation is non-negotiable when you're running model-generated code. Things like process isolation, filesystem restrictions, network controls etc. make complete sense. What I'm less sure about is whether VM-based approaches are necessary as often as people seem to think.

In my experience using coding agents locally, basic guardrails and sensible restrictions have been enough most of the time, at least when I'm operating in a relatively controlled environment and not deliberately pushing the agent into risky situations. Of course, that's very different from a production service running arbitrary user code.

So I'm curious: - What are you using to sandbox agents today? - What threat model are you optimizing for? - Have you had incidents that convinced you VM-level isolation was necessary? - Where do you draw the line between "good enough" and "needs stronger isolation"?

Would love to hear what has worked (or not worked) for others.

Comments

cpburns2009•49m ago

This is what I've been working on the past month: a strategy for running sandboxed containers. It's not strictly for agents (I'm now using it for OpenCode). You should be thinking about supply chain attacks for all of your applications that use third-party dependencies. PyPI and NPM have had a lot of compromised packages recently. The litellm hack affected a lot of agents, and there have been some Docker escape exploits.

The threat model I'm concerned about is supply chain attacks on third-party package repositories. The primary goal is to keep the convenience of containers, but to limit the blast radius of a compromised package or application, and reduce the risk of container escape. The software stack I'm currently evaluating is:

- Kata Containers: Backend for containerd to run each container in a KVM-backed QEMU microVM (alternative to the standard runc).

- containerd: Container runtime. Docker and Podman are not compatible with Kata 3. Kata 4 is supposed to fix that.

- nerdctl: Docker-compatible front-end to containerd.

- cni-plugins: networking component for containerd. Used to isolate containers networks.

- iron-proxy: MitM, TLS-intercepting egress proxy. This restricts all outbound traffic to whitelisted domains and IPs, and supports secret injection. Squid is a more established alternative.

How is this used in practice? I have a small bash script to launch the sandboxed OpenCode container with the current folder bind-mounted. OpenCode only has file-system access to the context directory, and limited network/internet access.

World Time, Date and Weather Resource

Anguished Parents, Crying Doctors: Life Amid Utah's Measles Outbreak

Angels Landing trail in Zion National Park closed until further notice

Control Planes Are Control Systems

China prepares $295B plan to fund nationwide AI data center buildout

There Is(Ǝ) – Such That (∋)

PiLSMer: A data-free key-value store

The enterprise identity crisis: Who's Alice?

South Korea fines e-commerce giant Coupang $400M over data breach

Show HN: Domain Rating – a leaderboard of startup website Domain Ratings

A gene regulates vertebrate growth, maturity, and lifespan

The outsized impact of cultural idiosyncrasies

AI and the Productivity Paradox

Liebreich: The Great Clean Energy Acceleration 2.0 – BloombergNEF

The $1M AWS Server [video]

Show HN: Brooks-Lint – AI code reviews grounded in 12 classic engineering books

The 90-year-old idea behind JEPA models: Canonical Correlation Analysis

Ask HN: Is anyone else seeing a Slack auth bug?

Elon Musk Is About to Make Saving for Retirement Even Harder

Explain AI: AI App Directory and Governance Platform

Euro-Office: First version of the open-source web office is here

The Model Is No Longer the Bottleneck

The Impossible Shift

Show HN: A GPT-realtime-2 tool that navigates your site by voice

MapComplete – Contibute to OpenStreetMaps

Helm AI Kernel, a fail-closed execution firewall for AI agents

Man sues Florida cops over arrest spurred by "93% match" in facial recognition

Explosive Weapons Monitor 2025

Which LLM is the best proofreader?

Show HN: Drawdown-protected custom FIRE portfolios with API for rebalance alerts