Show HN: Aether – Background agents that fix bugs in isolated VMs, opens PRs

6•pranav9•2h ago

Hey HN,

I've been building Aether, a background agent that takes production errors from Sentry and attempts to turn them into verified pull requests.

When a new error hits your Sentry project:

1. Sentry webhook fires with the stack trace, breadcrumbs, and context 2. Aether spins up an isolated Fly.io VM and clones the repo at the relevant commit 3. Agent analyzes the stack trace, reproduces the issue, proposes a fix 4. Starts the dev server, re-runs tests, and can verify the running app with Playwright (headless Chromium is pre-installed in every VM) 5. A review pass evaluates the diff before a PR is opened 6. Pushes to a feature branch and opens a GitHub PR, but only if verification succeeds 7. If CI fails, it retries once with the failure logs. If it fails again, the task is marked failed. No infinite loops.

Why full VMs instead of worktrees? Each task runs in its own isolated machine with a real filesystem, real process model, real network stack. It can `npm install`, run a dev server on port 3000, and Playwright can hit `localhost:3000` because it's an actual environment, not a sandbox. Since each task is its own VM, preview URLs are exposed per task via a gateway proxy so you can inspect the running app while the agent works. VMs shut down shortly after the task completes.

There's a simple multi-agent setup: a solver proposes the fix, a review agent evaluates the diff, and the fix has to survive re-execution in a clean isolated environment before a PR gets opened. Not claiming formal guarantees here, just requiring the fix to actually execute successfully in a reproducible environment before it touches your repo.

Limitations:

- Works best on well-tested codebases where "reproduce and verify" is meaningful - If reproduction isn't deterministic, results degrade - CI retry is capped at one automatic attempt - Code review is model-driven, not an architectural enforcement layer - BYOK only, you bring your own API key via OpenRouter. No markup on model costs but it's not super cheap to run - Sentry integration is built but waiting on approval from Sentry, coming soon - CLI is also coming soon

Bug fixing is the main focus but it's built on top of a general-purpose background agents system that works today. The agent is still great at general coding tasks. You can give the agent tasks from a full web IDE with a code editor, terminal, file tree, and agent chat panel. CLI is coming soon too (`aether run "add auth to the API"`). Each task gets its own isolated VM with shareable preview URLs so you can hand someone a link to see exactly what the agent built. Similar to Cursor background agents but running in the cloud with full environment isolation instead of local worktrees.

Stack: Go API (Chi), Fly.io VMs, React 19 + Vite frontend, Bun workspace service inside each VM, Supabase for auth/db/realtime, Playwright + Chromium preinstalled on each VM.

Self-serve right now: GitHub OAuth, connect a repo, and go via the web IDE. Sentry and CLI coming soon.

Would value feedback from engineers who deal with production debugging regularly, or frequently use background agents. Where would this break, and what would make you trust it?

Landing page: https://www.runaether.dev Try it: https://app.runaether.dev

Comments

JuliaHammel•1h ago

Exciting! What models does it use?

pranav9•1h ago

Main agent is currently Opus 4.6, and GPT 5.2 as the review agent (GPT 5.2 is great at reviewing code). There are also a lot of other models used for subagents for various small tasks.

corimero•1h ago

How expensive is a run usually?

Perfect heat rectification and circulation with nonreciprocal radiative surfaces

Rebuilding the American Dream, One Row House at a Time

Why Traditional DLP Fails in the Age of LLMs

There Are Fewer Excuses

NASA’s Artist’s Cooperation Program

Ask HN: What is the future of open source?

RaidenFTPD

Skills Manager

Lords of the Ring: The cultural politics of sumo wrestling

Show HN: URL-Based Geospatial Processing

Show HN: Free web search for AI agents via MCP (You.com)

Fighting Cognitive Debt in Agentic Code with Video Overviews

Opencode Commit: "Anthropic legal requests"

An ARM Homelab Server, or a Minisforum MS-R1 Review

RNA comes close to copying itself (with only 45-nucleotides)

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

LLMs create their smallest transformer for 10-digit addition

Show HN: I maintain Valkey GLIDE – built a Node queue doing 48k jobs/s

A Scheme Shell (1994) [pdf]

MuMu Player (NetEase) silently runs 17 reconnaissance commands every 30 minutes

When Models Manipulate Manifolds: The Geometry of a Counting Task [pdf]

Evaluating the Hardest CS Problems in the Age of LLMs

Show HN: 6cy v0.3.0 – A streaming-first binary archive format

Why consult of UK police on biometrics watchdog leaves too many questions

The Hawara Labyrinth – Preservation and Recovery Master Plan

Février 2026: L'IA s'auto-construit-elle et bouleverse-t-elle déjà vos RH?

The Most Concrete Indicator of a Housing Crisis

Big Tech Says Generative AI Will Save the Planet. It Doesn't Offer Much Proof

Dmux: Parallel agents with tmux and worktrees

Optimise Your LLM Workflow with the Chief Wiggum Workflow