frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: P0 – Yes, AI can ship complex features into real codebases

https://www.bepurple.ai/
17•arndt•2h ago

Comments

arndt•2h ago
Hey HN - Arndt here, one of the builders of p0. These always provoke some colorful discussion around here, so let me give some background.

We've been building p0 because we kept hitting the same wall: AI coding tools are great at generating code from scratch, but can fall flat when shipping complex features into multi-repo codebases with real architecture, real standards, and real constraints. We'd get impressive results at first glance, then spend hours fixing the output to match our actual patterns.

p0 bundles two things: a Mac desktop app and a purpose-built harness that treats feature development as a structured pipeline, not an open-ended chat.

How it works:

You start with a product spec (markdown) or an idea and end up with a set of PRs for all the repos that were touched.

p0 runs through a 5-phase pipeline in isolated Git worktrees:

1. Import your spec, and/or brainstorm with AI to refine it, grounded in your codebase and standards

2. A specialized agent breaks it into phased tickets with technical contracts (acceptance criteria, architecture prescriptions, dependencies)

3. Engineering agents implement tickets in parallel while you watch a live ticket tree and agent activity in real-time... or grab a coffee

4. QA agents run through verification loops to enforce the contracts

5. Refine and create PRs in your repos

Why not just use Claude Code Plan mode / [name your tool]

We actually use Claude Code under the hood. What makes p0 different from Claude Code CLI / Conductor / etc. is our focus on shipping complex features autonomously, across all your existing repos.

Spec first -- Puts humans in control of as much product and technical details as you can imagine, and we help you create/refine with agents grounded in your codebase.

Contracts and QA loops -- We generate clear acceptance criteria and boundaries for each task. QA loops make sure they were adhered to.

Ticketing -- The architect breaks every feature into phased tickets with dependency ordering. This isn't just a simple plan, it allows us to break complex problems into smaller, context-fitting tickets, and bring them back together into one cohesive implementation.

Subagents -- Nothing fancy here, just a set of roles we've fine-tuned for months, so you don't have to start from scratch.

Standards -- Typical coding agent behavior is to get a cursory (haha) understanding by reading code. But that clogs up context quickly, and rarely rises to the architecture understanding level. When you first launch p0, we'll help you generate a better AI-targeted documentation.

Multi repo -- The whole harness is multi-repo aware. It maintains cross-repo context (imports, API contracts, shared types) and creates coordinated worktrees across all your repos in a single session.

Local-first, team features through the cloud:

All code stays on your machine on isolated worktrees. We do sync codebase documentation and workspace setup through our cloud so your teams can share those for convenience. And of course the prompts go to Anthropic's API.

Limitations:

- The spec-driven workflow has a learning curve. If you're used to the "chat away as you go" flow, the structured planning is a new thing to get used to. - macOS only right now, Linux/Windows are on the roadmap. - Works best for substantial features -- for small features, you're better off using Claude's plan mode. - Requires a Claude subscription or API key and works best with high limits / the 20x plan. Everything is finetuned for Claude 4.6 Opus right now. We plan on supporting other providers, but Claude is where the quality bar is.

What we'd love for you to try:

Is the spec-driven workflow helpful in building larger features? Did we miss anything? What integrations matter most? (We support GitHub, Gitlab, Linear right now)

You can download p0 at https://www.bepurple.ai. Happy to answer any questions about the architecture or approach.

the_tli•2h ago
We've started to be an early user in December and have since adopted it in a brown field codebase. I'd describe this as Lovable 2.0 or vibe coding 2.0. The p0 workflow allowed us to delegate medium to complex full features to Claude Code while staying in lane with our standards. It allows us to go from idea to fully working prototype and PR within half and hour to hour, most of the time fully hands off. PRs still need to be reviewed for production. However p0 allowed to drastically improve per engineer velocity, AI code quality and iterate faster with working prototypes and refinements.

Compared to Claude Code directly, which we also use heavily, p0 keeps very strong coherence from user story, spec planning, architecting, engineering and QA - across many several agents and subagents. Breaking down the work into sequential and parallel task. With Claude Code alone this would be usually requiring lots of hand holding, or be be only partially focussed, rest lost in the woods. Also, we attempted to replicate some of p0 ideas with home grown software dev personas and workflows which fell apart. I think the strong point of p0 is that they really nailed the decomposition and software dev cycle with agents.

Really recommend to try, and at the very minimum you get to make your codebase agent ready if you haven't already.

mareko•1h ago
Interesting approach to put a lot of emphasis on the spec.

Does this only work on existing codebases?

arndt•58m ago
No, also works on new projects. We do recommend you set up "standards" first though, a documentation of how you want things to be done, for example how you do auth, how you handle multi-tenancy, how migrations work, whatever applies to your new project. The whole point of spec-driven is not not let AI wing it, but be very prescriptive.

We include a couple of templates to make that easier - NextJs + Convex + ShadCn/ui, NextJs + Supabase + ShadCn/ui, etc.

HrubyOnRails•1h ago
I first tried p0 about a month ago. What stood out to me was the way the repo onboarding is designed. The harness puts you through a 15 minute Q&A to generate "standards" that it then keeps in sync with the codebase. IMO that is the best implementation of self-generating repo rules I have seen.
arndt•1h ago
Thank you, and also good luck with your HN launch of BrowserBrawl today!
ngs-schlingel•1h ago
Actually really promising! How does it differ from Claude Code? Massive opportunity right here!
arndt•56m ago
Let me just quote my original post :) > We actually use Claude Code under the hood. What makes p0 different from Claude Code CLI / Conductor / etc. is our focus on shipping complex features autonomously, across all your existing repos.

In practice this means, you spend much more time defining coding standards and writing product and technical specs (our agents help you with both of course, but you bring the brainpower), and then you hit the button and let p0 build even some seriously large features.

franziloew•46m ago
I started using P0 on our legacy codebase a couple of weeks ago and so far have shipped 3 features to production. Very helpful tool!
jasper_platz•20m ago
Compared to Claude Code etc where do the time savings come from? Fewer PRs that need manual intervention? Higher quality code base to maintain later?
coscreen•9m ago
Worked great for me. Think about integrating the /counselors skill so you can fan out complex questions to multiple hq models across Opus, Gemini, and Codex. We're using it to do extensive design/performance/SEO audits that benefit from multiple smart opinions.

Show HN: Stacked Game of Life

https://stacked-game-of-life.koenvangilst.nl/
119•vnglst•4d ago•23 comments

Show HN: I made a zero-copy coroutine tracer to find my scheduler's lost wakeups

https://github.com/lixiasky-back/coroTracer
39•lixiasky•1d ago•1 comments

Show HN: Recite – I built an Skill and MCP so my AI agent does my bookkeeping

https://github.com/rivradev/recite-agent-skill
2•rivradev•1h ago•0 comments

Show HN: O4DB – Intent-based M2M protocol without centralized APIs

https://github.com/dannythecountok/O4DB-protocol
2•dannythecount•1h ago•2 comments

Show HN: AI Code Review CLI

https://github.com/kodustech/cli
3•eddelgado•1h ago•0 comments

Show HN: I built a browser-based 3D modeler because I'm scared of Blender

https://app.topomaker.com/
4•whothatcodeguy•1h ago•0 comments

Show HN: GitPulse – stop buying dead software (and a timeline for your dev life)

https://www.gitpulse.dev/
2•bombashell•2h ago•1 comments

Show HN: Open-sourced a web client that lets any device use Apple's on-device AI

https://github.com/Techopolis/perspective-intelligence-web-community
5•tayarndt•2h ago•0 comments

Show HN: Zsh plugin to switch macOS Terminal.app profiles

https://github.com/sfcodes/zsh-terminal-profile
2•sfcodes•2h ago•1 comments

Show HN: P0 – Yes, AI can ship complex features into real codebases

https://www.bepurple.ai/
17•arndt•2h ago•11 comments

Show HN: oMLX – SSD-backed KV cache cuts coding agent TTFT from 90s to 1s on Mac

https://github.com/jundot/omlx
4•jundot•3h ago•1 comments

Show HN: What % of your commits were written by AI?

https://technically-your-name-is-on-it.btao.org/
3•tao_oat•3h ago•1 comments

Show HN: ClawSandbox – 7/9 attacks succeeded against an AI agent w/ shell access

https://github.com/deduu/ClawSandbox
2•ariansyah•3h ago•3 comments

Show HN: Retrievo – In-memory hybrid search for .NET AI agents

https://github.com/TianqiZhang/Retrievo
3•ztq121121•4h ago•0 comments

Show HN: Rust compiler in PHP emitting x86-64 executables

https://github.com/mrconter1/rustc-php
48•mrconter11•3d ago•46 comments

Show HN: An MCP server for the docs of any repo that uses Sphinx

https://github.com/AUrbanec/sphinxdocs_mcp
2•btcalex•5h ago•0 comments

Show HN: Glyph, a local-first Markdown notes app for macOS built with Rust

https://glyphformac.com/
3•skarat•6h ago•2 comments

Show HN: Effective Git

https://github.com/nolasoft/okgit
33•nola-a•3d ago•5 comments

Show HN: I built a sub-500ms latency voice agent from scratch

https://www.ntik.me/posts/voice-agent
560•nicktikhonov•1d ago•152 comments

Show HN: Cloudwright – validate, cost, and export cloud architectures from text

https://github.com/xmpuspus/cloudwright
5•xmpuspus•9h ago•0 comments

Show HN: Timber – Ollama for classical ML models, 336x faster than Python

https://github.com/kossisoroyce/timber
202•kossisoroyce•2d ago•33 comments

Show HN: Open-Source Article 12 Logging Infrastructure for the EU AI Act

42•systima•1d ago•2 comments

Show HN: Omni – Open-source workplace search and chat, built on Postgres

https://github.com/getomnico/omni
168•prvnsmpth•2d ago•42 comments

Show HN: Demucs music stem separator rewritten in Rust – runs in the browser

https://github.com/nikhilunni/demucs-rs
14•nikhilunni•1d ago•2 comments

Show HN: Pianoterm – Run shell commands from your Piano. A Linux CLI tool

https://github.com/vustagc/pianoterm
59•vustagc•1d ago•21 comments

Show HN: Agent Action Protocol (AAP) – MCP got us started, but is insufficient

https://github.com/agentactionprotocol/aap/
13•hank2000•23h ago•2 comments

Show HN: Term-CLI – interactive terminals for AI agents (for SSH/TUI/REPL flows)

https://github.com/EliasOenal/term-cli
7•eliasoe•14h ago•2 comments

Show HN: Armalo AI – The Infrastructure for Agent Networks

2•ArmaloAI•9h ago•3 comments

Show HN: Yare.io – Kill all enemy cats. With JavaScript.

https://yare.io
3•levmiseri•9h ago•1 comments

Show HN: We want to displace Notion with collaborative Markdown files

https://www.moment.dev/
27•antics•22h ago•6 comments