frontpage.

Workflow to build context for coding agents

2•wek•1h ago

Here’s the workflow my team and I have found works best with coding agents:

- Plan: Write a plan in markdown. Edit this. Iterate. The plan isn’t a throwaway note. It tracks status as work progresses (draft -> in-development -> in-review -> completed), versions with git alongside the code, and serves as the single source of truth. When the agent later implements, it reads this document. When we review the work, we compare against it.

- Diagram: Have the agent enrich the plan with architecture diagrams and data models. Edit this. Iterate. These artifacts live alongside the plan and the code. When the agent later implements, it doesn’t need us to re-explain the architecture or the schema. It reads them directly.

- Mockup: For anything with a UI, we create mockups before touching code. We generate interactive html/javascript most of the time. This replaces the Figma-to-engineering handoff entirely. When the agent implements the UI, it already knows what it should look like. No exporting, no describing screenshots in words, no “make it look like the design.”

- Tests: We have the agent write tests based on the plan, diagrams, and mockup. We review them, add edge cases, and now we have an executable definition of “done.”

- Implement: Now we tell the agent: “Implement the notification system. Run tests after each major change. Keep going until all tests pass.” The agent works iteratively. Implements the database migration from the data model. Runs tests — schema tests pass. Builds the WebSocket server. Implements the frontend. Runs Playwright and catches a CSS issue from the screenshot, fixes it, reruns. Eventually: all green.

- Review the work. When the agent finishes, we review. We click through the changes, see exactly what was added, modified, or removed, and compare it against the plan and mockup.

- Commit

- Update the Plan: After committing, we close the loop. We ask the agent to update the plan: status moves from in-development to completed, acceptance criteria get checked off, and any implementation notes get added. If anything doesn't match, either the plan gets updated or we rework the code.

- Update Docs and Website: The agent updates our documentation and our website, keeping everything in sync and up-to-date

What I like about this and why it works is that each step produces context that the next step consumes. By the time the agent starts writing code, it has the spec, the architecture diagram, the database schema, the mockup, and the test suite. Once its done coding, we update everything giving us clean context to build on.

Where Some See Strings, She Sees a Space-Time Made of Fractals

TruthLayer – Real-time AI hallucination firewall on AWS

Karpathy is searching for the Agentic IDE

The path to room-temperature superconductivity: A programmatic approach

A look inside Dialector, filmmaker Chris Marker's chatbot from 1988

U+237C ⍼ Right Angle with Downwards Zigzag Arrow Is a Symbol for Azimuth

GitLab Active Incident

Strait of Hormuz closure can become tipping point for global economy

UK Lords Back Facial Recognition Overreach, Protest Crackdown Powers

RCE in Your Test Suite: How AI Agent Skills Bypass Every Skill Security Scanner

Some Simple Economics of AGI

Show HN: Ink – Deploy full-stack apps from AI agents via MCP or Skills

Do animals have a future on Hollywood sets?

Education research: How students use AI-powered hints in programming courses

Temporal: The 9-Year Journey to Fix Time in JavaScript

I quit Rails core 4 years ago, here's what I've been up to

The T1 Trust – Building a new PRR T1 locomotive from the original plans

Turn Any Excel, CSV or Data File into an Interactive Dashboard in 5 Seconds

Show HN: Run 100 RAG experiments in parallel, even on a single GPU

I believe in SOTA models over custom ones

Modern wealth is a parlour game played by the well fed

Show HN: GladeKit – AI agent for Unity game development

Show HN: Another SQLite editor in browser powered by WASM and AI

Why are languages spoken at different speeds?

NASA's next X-ray mission, AXIS, has been killed

"If it sounds literary, it isn't": deceptively simple rules behind good writing

OpenAI's Race to Catch Up to Claude Code

QORA-LLM-2B – Pure Rust ternary inference, no multiplication needed

Ayar Labs, Wiwynn to cram 1,024 GPUs into photonic system

It's weird to have a skull full of poison