On the volume number. "Several hundred thousand LOC per week sustained, with peaks over a
million" is gross churn (insertions + deletions) across feature code, tests, generated types,
lockfile updates, and migrations. Net new shippable code is a smaller fraction. The point isn't
raw LOC; it's that the pipeline can sustainably operate on a real production codebase at that
throughput, not a toy.
On stacks supported. The pipeline orchestrates whatever your project uses. Migration adapters
cover Rails (Active Record), Alembic, Django, Prisma, Drizzle, golang-migrate, dbmate, flyway,
supabase-cli, ecto, and typeorm; falls back to a configurable shell command for anything else.
Deploy adapters cover Vercel, Fly, Render, and a generic shell adapter. Validate runs whatever
test/lint/typecheck command you configure (npm test, pytest, go test, anything). Monorepo support
auto-detects npm/yarn/pnpm workspaces, Turborepo, and Nx. Review engine adapters cover Claude,
Gemini, Codex, and any OpenAI-compatible endpoint (Groq, Ollama, Together).
Why this vs Devin or Cursor agent mode. Devin is hosted, opaque, per-ACU billed, single-vendor
stack. claude-autopilot runs locally, every phase is an editable skill, you bring your own
provider keys, MIT-licensed. Cursor agent mode is a single-shot in-IDE loop. claude-autopilot
sits one layer higher: spec review, implementation dispatch, validation, PR review, release
workflow, retry-loop progress detection.
Closest cousins. Aider, OpenHands, SWE-agent. We share the local-CLI plus user's-key philosophy
and add the phase pipeline, multi-model role split, risk-tiered review, and the retry-loop
sameness detector (halts the pipeline when retries make no progress instead of burning the retry
budget on attempts going nowhere).
See it work, with numbers:
- DEMO.md walks through one autonomous run, 12 minutes wall clock, $2.20 spend, 5 new tests:
https://github.com/axledbetter/claude-autopilot/blob/master/DEMO.md
- Benchmark: 13/13 production-realistic bugs caught in 38 seconds for $0.21, reproducible:
https://github.com/axledbetter/claude-autopilot#benchmark
Happy to dig into any of it.
axledbetter01•34m ago