frontpage.

Show HN: Typed Natural Language – A better plan mode with workflow for coding

https://github.com/janaraj/tnl

1•janaraj•1h ago

Plan mode in Claude Code / Codex works, for one session. Next session, your agent re-reads source and re-derives the same decisions you already made. TNL (Typed Natural Language) is that same review-before-code discipline, but persistent: a short English contract with a fixed schema (paths, behaviors with MUST/SHOULD/MAY/[semantic], non-goals), proposed by the agent, approved by you, implemented against, saved on disk, and read by every future session.

It's not a new agent or tool, it slots into whatever you already use. npx typed-nl init adds a workflow stanza to your CLAUDE.md / AGENTS.md / GEMINI.md, scaffolds a tnl/ directory, and optionally wires a PreToolUse hook and MCP server. The minimum product is a stanza + a folder. Hooks, MCP, and tnl verify (CI gate for path and test-binding integrity) are optional layers.

We ran a controlled A/B on an existing 16KLOC Python codebase, event-driven triggers, a 35-scenario behavioural matrix, deliberately ambiguous prompt. Both Baseline and TNL conditions got the same coding discipline in their instruction file; Same agent, same model, same base commit.

Results:

  Agent                   TNL     Baseline   Gap
  Claude Opus 4.7 (R1)    35/35   29/35      +6
  Claude Opus 4.7 (R2)    31/35   27/35      +4
  Claude Opus 4.7 (R3)    30/35   25/35      +5
  Codex GPT-5.4 (R1)      32/35   26/35      +6
  Codex GPT-5.4 (R2)      31/35   26/35      +5

No overlap: TNL's lowest paired cell is 86%, baseline's highest is 83%.

Other signals:

Follow-up work: on round-2 tasks in the same worktrees, TNL agents edited the existing contract (4/4 samples); baseline re-read source. Caveats: small n, LLM sessions are noisy, and we built the tool. Every script, prompt, raw JSON, and session transcript is committed.

We dogfooded it, every feature of the tool itself has its own TNL in tnl/.

Install: npx typed-nl init Repo: https://github.com/janaraj/tnl npm: https://www.npmjs.com/package/typed-nl

Happy to answer questions, especially from people who've tried plan-mode workflows and want to know where this differs.

The (other) problem with automatic conversion of free software to proprietary

Show HN: Run coding agents in microVM sandboxes instead of your host machine

My phone replaced a brass plug

AI is the new Oracle of Delphi. That's bad news

Open Source SaaS Is Dead; Long Live Open Source

A backup MX will get accessed by various sorts of people

HelloESP: A public website hosted on an ESP32

Incident with Multple GitHub Services

Where the Sweetest Margins Live in Jensen's 5-Layer Cake

The $150 Train to a $2k Seat: The World Cup of Price Shock

Supreme Court arguments make it clear that FCC fines are "nonbinding"

Book on building your own package manager in Rust

A good AGENTS.md is a model upgrade. A bad one is worse than no docs at all

Bikes keep Honda afloat, yet even that business is under pressure

The Origins of GPU Computing

Gluon&Linear Layouts Deep-Dive:Tile-Based GPU Programming with Low-Level Control [video]

China shipped a record 68 GW of solar in March – here's why it matters

870 EVO SATA 2.5 inch 2TB SSD

NCSC: Leave passwords in the past – passkeys are the future

Microsoft Vibing – capturing screenshots and voice samples without governance

Wading Through AI (Casey Muratori and Demetri Spanos)

Open Source and the Iceberg Theory

Anthropic's growing pains mount ahead of OpenAI showdown

30 Days Running ChatGPT Plus, Claude Pro, and Google AI Pro in Parallel

US Army announces new Combat Field Test to enhance Soldier readiness

Is Claude Code going to cost $100/month? Probably not–it's all confusing

Using an AI agent to navigate an undocumented Kubernetes repo

.genome: a genome file format designed for AI (Apache-2.0)

If America's So Rich, How'd It Get So Sad?

Why prediction markets are a sure sign that our civilisation is in decay