frontpage.

Show HN: Polos: Open-source runtime for AI agents with sandbox and durable exec

https://github.com/polos-dev/polos

1•ndeodhar•1h ago

Hi HN, I'm Neha. I spent years at Google building infrastructure that handled billions of events at 99.999% reliability. When I started building AI agents, I was surprised at how much production plumbing you're expected to own yourself.

The agent itself is the easy part. The hard part is everything around it: where does it execute safely? What happens when it fails midway through a workflow? How do you trigger it from your existing tools? How do you even know what it did?

I kept stitching together Docker, a workflow engine, a notification layer, and custom retry logic. Every team I talked to was doing the same thing. So I built Polos - an open-source runtime that handles the production layer so you just write the agent.

What it does:

- Sandboxed execution: agents run sensitive operations inside managed Docker containers with built-in tools for file I/O, bash, and web search. You don't manage the sandbox or its lifecycle, Polos does. Will support more sandboxes like E2B in the future.

- Slack integration: @mention an agent in Slack, get responses in thread. Trigger workflows from Slack, receive notifications, collect input. Agents become part of your team's existing workflow.

- Durable workflows: if an agent fails mid-run, it resumes from the exact step that failed. Built-in prompt caching with 60-80% cost savings on retries.

- Observability: OpenTelemetry tracing for every step, tool call, and decision.

- LLM agnostic: works with OpenAI, Anthropic, Google, or any provider via Vercel AI SDK and LiteLLM.

The stack is Rust orchestrator (Axum + Tokio + PostgreSQL), Python and TypeScript SDKs, and Vite UI. You can install and run a durable, sandboxed agent in under 5 minutes:

```

curl -fsSL https://install.polos.dev/install.sh | bash

npx create-polos

cd my-project && polos dev

```

Here's a 3-min demo of a coding agent that picks up a GitHub issue, fixes the code in a sandbox, and submits a PR: https://www.youtube.com/watch?v=KYVBpdZ_5eM

Happy to discuss technical decisions and more: why Rust for the orchestrator, how durable execution works without a DAG, and the sandbox lifecycle model.

GitHub: https://github.com/polos-dev/polos

Fredrick Brennan, founder of 8chan, has died

Hacker used Anthropic's Claude chatbot to attack government agencies in Mexico

Ralph-code – Structured autonomous coding loop with Claude Code and Codex

Show HN: Pretty plots powered by gnuplot and WASM

The Appeal and Reality of Recycling LoRAs with Adaptive Merging

A formal proof that a tax system can function without compliance decisions

What Makes People Proud of Their Country?

Show HN: Agent that matches sales reps with warm leads based on product usage

West Virginia's Anti-Apple CSAM Lawsuit Would Help Child Predators Walk Free

Respecting maintainer time should be in security policies

LM Studio: LM Link

Minimum Viable Coding Agents

I Run an AI Hedge Fund. AI Intelligence Has Plateaued. Here's the Data

Tether invests $200M in Whop to expand stablecoin payments

Ghosts'n Goblins – "Worse danger is ahead"

Some Simple Economics of AGI

Why everyone is reading fantasy

Show HN: ShiLLM – An LLM that inserts ads into every response

Markdown DOM Spec for LLMs: Request for Comment

Show HN: Digest.tube – skim YouTube videos like articles

Creating "Edit" Links That Open Plain-Text Source Files in a Native App

Show HN: WhatsApp Group Contact Extractor - Paste JS, get group contacts .tsv

Deploying Open Source Vision Language Models (VLM) on Jetson

Show HN: HN Digest Widget – Nothing Essential Lab S1 Winner

LLM-LD, the Open Standard for AI-Readable Websites

Sutton and Barto, Ch. 08: Planning and Learning with Tabular Methods

Fish Shell 4.0 released. Rust re write finished

Show HN: BountyBook – A task marketplace where AI agents earn USDC

What Virtual Worlds Can Learn from the Social Serendipity of Arc Raiders

Show HN: VibeFrame – AI video editor for the terminal (CLI and MCP)