frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Aether – Background agents that fix bugs in isolated VMs, opens PRs

6•pranav9•2h ago
Hey HN,

I've been building Aether, a background agent that takes production errors from Sentry and attempts to turn them into verified pull requests.

When a new error hits your Sentry project:

1. Sentry webhook fires with the stack trace, breadcrumbs, and context 2. Aether spins up an isolated Fly.io VM and clones the repo at the relevant commit 3. Agent analyzes the stack trace, reproduces the issue, proposes a fix 4. Starts the dev server, re-runs tests, and can verify the running app with Playwright (headless Chromium is pre-installed in every VM) 5. A review pass evaluates the diff before a PR is opened 6. Pushes to a feature branch and opens a GitHub PR, but only if verification succeeds 7. If CI fails, it retries once with the failure logs. If it fails again, the task is marked failed. No infinite loops.

Why full VMs instead of worktrees? Each task runs in its own isolated machine with a real filesystem, real process model, real network stack. It can `npm install`, run a dev server on port 3000, and Playwright can hit `localhost:3000` because it's an actual environment, not a sandbox. Since each task is its own VM, preview URLs are exposed per task via a gateway proxy so you can inspect the running app while the agent works. VMs shut down shortly after the task completes.

There's a simple multi-agent setup: a solver proposes the fix, a review agent evaluates the diff, and the fix has to survive re-execution in a clean isolated environment before a PR gets opened. Not claiming formal guarantees here, just requiring the fix to actually execute successfully in a reproducible environment before it touches your repo.

Limitations:

- Works best on well-tested codebases where "reproduce and verify" is meaningful - If reproduction isn't deterministic, results degrade - CI retry is capped at one automatic attempt - Code review is model-driven, not an architectural enforcement layer - BYOK only, you bring your own API key via OpenRouter. No markup on model costs but it's not super cheap to run - Sentry integration is built but waiting on approval from Sentry, coming soon - CLI is also coming soon

Bug fixing is the main focus but it's built on top of a general-purpose background agents system that works today. The agent is still great at general coding tasks. You can give the agent tasks from a full web IDE with a code editor, terminal, file tree, and agent chat panel. CLI is coming soon too (`aether run "add auth to the API"`). Each task gets its own isolated VM with shareable preview URLs so you can hand someone a link to see exactly what the agent built. Similar to Cursor background agents but running in the cloud with full environment isolation instead of local worktrees.

Stack: Go API (Chi), Fly.io VMs, React 19 + Vite frontend, Bun workspace service inside each VM, Supabase for auth/db/realtime, Playwright + Chromium preinstalled on each VM.

Self-serve right now: GitHub OAuth, connect a repo, and go via the web IDE. Sentry and CLI coming soon.

Would value feedback from engineers who deal with production debugging regularly, or frequently use background agents. Where would this break, and what would make you trust it?

Landing page: https://www.runaether.dev Try it: https://app.runaether.dev

Comments

JuliaHammel•1h ago
Exciting! What models does it use?
pranav9•1h ago
Main agent is currently Opus 4.6, and GPT 5.2 as the review agent (GPT 5.2 is great at reviewing code). There are also a lot of other models used for subagents for various small tasks.
corimero•1h ago
How expensive is a run usually?

Perfect heat rectification and circulation with nonreciprocal radiative surfaces

https://journals.aps.org/prresearch/abstract/10.1103/7p58-n6yv
1•PaulHoule•3m ago•0 comments

Rebuilding the American Dream, One Row House at a Time

https://www.thefp.com/p/rebuilding-the-american-dream-one
1•mhb•4m ago•0 comments

Why Traditional DLP Fails in the Age of LLMs

https://chris-s-lambert.com/why-traditional-dlp-fails-llms.html
1•clambert4•4m ago•0 comments

There Are Fewer Excuses

https://ctts.substack.com/p/there-are-fewer-excuses
1•threefiftyone96•7m ago•0 comments

NASA’s Artist’s Cooperation Program

https://airandspace.si.edu/explore/stories/nasa-art-program
1•animal_spirits•7m ago•0 comments

Ask HN: What is the future of open source?

1•andreybaskov•12m ago•0 comments

RaidenFTPD

https://www.raidenftpd.com/en/
1•metadat•12m ago•0 comments

Skills Manager

https://github.com/razbakov/skills-manager
1•razbakov•13m ago•0 comments

Lords of the Ring: The cultural politics of sumo wrestling

https://harpers.org/archive/2026/03/lords-of-the-ring-joshua-hunt-cultural-politics-sumo-wrestling/
1•zdw•15m ago•0 comments

Show HN: URL-Based Geospatial Processing

https://geoblob.com/
1•roniemartinez•17m ago•1 comments

Show HN: Free web search for AI agents via MCP (You.com)

https://docs.you.com/developer-resources/mcp-server
1•bsparker•21m ago•0 comments

Fighting Cognitive Debt in Agentic Code with Video Overviews

https://enigmeta.com/posts/2026-02-19-video-overviews-for-agentic-code/
1•fdb•22m ago•1 comments

Opencode Commit: "Anthropic legal requests"

https://github.com/anomalyco/opencode/commit/973715f
2•isehgal•24m ago•1 comments

An ARM Homelab Server, or a Minisforum MS-R1 Review

https://sour.coffee/2026/02/20/an-arm-homelab-server-or-a-minisforum-ms-r1-review/
3•neelc•30m ago•3 comments

RNA comes close to copying itself (with only 45-nucleotides)

https://www.science.org/doi/10.1126/science.aeg3559
4•marojejian•31m ago•0 comments

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

https://huggingface.co/blog/openenv-turing
1•gmays•31m ago•0 comments

LLMs create their smallest transformer for 10-digit addition

https://twitter.com/DimitrisPapail/status/2024555561199480918
1•marojejian•33m ago•1 comments

Show HN: I maintain Valkey GLIDE – built a Node queue doing 48k jobs/s

https://github.com/avifenesh/glide-mq
1•anotherCodder•33m ago•0 comments

A Scheme Shell (1994) [pdf]

https://publications.csail.mit.edu/lcs/pubs/pdf/MIT-LCS-TR-635.pdf
1•todsacerdoti•34m ago•0 comments

MuMu Player (NetEase) silently runs 17 reconnaissance commands every 30 minutes

https://gist.github.com/interpiduser5/547d8a7baec436f24b7cce89dd4ae1ea
2•interpidused•34m ago•0 comments

When Models Manipulate Manifolds: The Geometry of a Counting Task [pdf]

https://arxiv.org/abs/2601.04480
1•vinhnx•35m ago•0 comments

Evaluating the Hardest CS Problems in the Age of LLMs

https://frontier-cs.org/blog/evaluation/
1•matt_d•39m ago•0 comments

Show HN: 6cy v0.3.0 – A streaming-first binary archive format

https://github.com/byte271/6cy/releases/tag/v0.3.0
1•yihac1•39m ago•0 comments

Why consult of UK police on biometrics watchdog leaves too many questions

https://www.biometricupdate.com/202602/why-the-latest-consultation-on-uk-police-biometrics-watchd...
1•WaitWaitWha•40m ago•0 comments

The Hawara Labyrinth – Preservation and Recovery Master Plan

https://archaeologicalrescue.org/hawara/
1•FloorEgg•41m ago•0 comments

Février 2026: L'IA s'auto-construit-elle et bouleverse-t-elle déjà vos RH?

https://nadiaaccompagne.substack.com/p/fevrier-2026-lia-sauto-construit
1•NadiaAccompagne•44m ago•0 comments

The Most Concrete Indicator of a Housing Crisis

https://shonczinner.substack.com/p/the-most-concrete-indicator-of-a
2•kjshsh123•45m ago•1 comments

Big Tech Says Generative AI Will Save the Planet. It Doesn't Offer Much Proof

https://www.wired.com/story/big-tech-says-generative-ai-will-save-the-planet-it-doesnt-offer-much...
4•g-b-r•46m ago•1 comments

Dmux: Parallel agents with tmux and worktrees

https://dmux.ai/
2•handfuloflight•46m ago•0 comments

Optimise Your LLM Workflow with the Chief Wiggum Workflow

https://etoxin.net/wiggum/
1•etoxin•48m ago•0 comments