frontpage.

I've been using Claude Code and Cursor for several months on a real project. The tools are impressive, but I kept running into the same failure modes:

1. Long sessions cause context drift — the AI gradually ignores the original design 2. The AI writes fake tests — empty assertions, mocking the thing being tested 3. No research phase — the AI guesses how a framework works instead of reading the docs

OPC Workflow is my fix: three markdown files you put in your project and trigger as slash commands (/plan_sprint, /sprint, /audit).

The core mechanic is isolated sessions: - Planning happens in session A, then you close it - Development happens in session B, then you close it - Auditing happens in session C with zero knowledge of session B

The audit is the part I'm most proud of. It runs mutation testing — deliberately breaking each core function to verify the tests actually catch it. In my project, it found a module that directly instantiated components, bypassing the agent registry entirely. Security boundaries, tool injection, and the memory system were all silently failing. I had written both the code AND the tests. Confirmation bias is a real problem.

Real numbers: 7 sprints, 459 tests, 100% mutation capture rate, 1 critical bug found.

It works with Claude Code, Cursor, Kiro, and Antigravity. One-line install for mac/linux:

  bash <(curl -sSL https://raw.githubusercontent.com/yoyayoyayoya/opc-workflow/main/install.sh)

And for Windows PowerShell:

  iex (iwr -useb 'https://raw.githubusercontent.com/yoyayoyayoya/opc-workflow/main/install.ps1').Content

Open to feedback, especially from people who've found other failure modes with AI coding tools.

https://github.com/yoyayoyayoya/opc-workflow

The Strait of Hormuz is now open

AI Tool Blindness

Solo founders and indie hackers should have a backup plan

Two US citizens sentenced for running North Korean laptop farms

Stop Killing Games at the European Parliament Full Hearing [video]

Show HN: Using an AI agent to refine a ML model for Zephyr RTOS

Cloudflare: The Agent Readiness score. Is your site agent-ready?

Consider sending a list of everything you did to your coworkers everyday

Scientists Develop "Molecular Scissors" Alternative to Cas9

Rejoice: A concatenative multiset language built on Fractran-like primitives

Why Amazon Is Buying Globalstar–and What It Means for Your iPhone

Chinese fabs import US chipmaking equipment via Singapore and Malaysia

How should you change your life if we are being watched by alien drone probes?

Distill MCP – Turn your reading queue into a podcast, via Claude Code MCP

Is 1 Nit Enough? – Phone Minimum Display Brightness

Linux 7.1 Crypto Code Rework Enables More Optimizations by Default

What Is Infrastructure from Code?

A third of Americans don't drive. So why is our transportation so car-centric?

Teaching a Model to Code

Replaced Official Release Date Trailer [video]

Anthropic Quadruples London Office Amid US Regulatory Tensions

White House Investigating Wave of Missing or Dead Scientists

High Amplitude Disagreeableness – Stay SaaSy

Reflections on Trusting Trust [pdf]

Twilio Account Hacked

Show HN: Use real handwriting for messages and forums (Write Me, Maybe)

WorldSeed – define a world in YAML, let AI agents live in it

Great Docs for Python Project Documentation

PostgreSQL MVCC, Byte by Byte

Show HN: Noodlist – Letterboxd for Instant Ramen

OPC Workflow – Three Markdown files that enforce discipline on AI coding tools