Writing High Quality Production Code with LLMs Is a Solved Problem

https://escobyte.substack.com/p/writing-high-quality-production-code

6•menzoic•1h ago

Comments

menzoic•1h ago

I work at Airbnb where I write 99% of my production code using LLMs. Spotify's CEO recently announced something similar, but I mention my employer not because my workflow is sponsored by them (many early adopters learned similar techniques), but to establish a baseline for the massive scale, reliability constraints, and code quality standards this approach has to survive.

Many engineers abandon LLMs because they run into problems almost instantly, but these problems have solutions. If you're a skeptic, please read and let me know what you think.

The top problems are:

* Constant refactors (generated code is really bad or broken)

* Lack of context (the model doesn’t know your codebase, libraries, APIs, etc.)

* Poor instruction following (the model doesn’t implement what you asked for)

* Doom loops (the model can’t fix a bug and tries random things over and over again)

* Complexity limits (inability to modify large codebases or create complex logic)

In this article, I show how to solve each of these problems by using the LLM as a force multiplier for your own engineering decisions, rather than a random number generator for syntax.

A core part of my approach is Spec-Driven Development. I outline methods for treating the LLM like a co-worker having technical discussions about architecture and logic, and then having the model convert those decisions into a spec and working code.

carrot5Top•1h ago

For sure, with the latest models, treating the model like a respected professional that needs context and input is essential. usually I get the best results when the context window is right around 70% full

menzoic•1h ago

> get the best results when the context window is right around 70%

I used to be trigger happy with /compact or using the hand off technique to transfer knowledge between sessions with a doc. But lately the newer generation of models seem to be handling long context pretty well up to around 20% remaining context.

But this is when I'm working on the same focused task. I would instantly reset it if I started implementing an unrelated task. Even if there was 90% left, since theres just no benefit to keeping the old context

chalmers•1h ago

Yep! That’s almost exactly the workflow I’ve landed on too. I could not agree more

menzoic•1h ago

It's basically the typical SDLC boosted with LLMs. Especially the part where you can explore tradeoffs and alternative approaches rapidly.

Soupzzz•1h ago

I read you are using Codex and lost interest in the rest of the post

menzoic•1h ago

LOL, honestly I hated Codex when it first came out. It was backed by o3 at the time.

But literally as soon as GPT-5 came out in Codex and with the "high" option, I completely switched from Claude Codex to Codex. Never imagined that would happen so fast.

A visual summary of the 5 prerequisites for improvement

Zwasm: A fast, spec-compliant WebAssembly runtime written in Zig

Americans are destroying Flock surveillance cameras

Life at the Frontlines of Demographic Collapse

I analyzed hundreds of humans vs. AI Tetris games, here's what I found

Real-time security reasoning inside your IDE

Fuss: OverlayFS Without Mounting

Alleged Distillation Attacks by DeepSeek, Moonshot AI, and MiniMax

ESR posits that the C-era is reaching its natural conclusion

Show HN: Emotica – AI that analyzes your emotions instead of just tracking them

Muscle Cathepsin B Improves Neurogenic Deficits in Mouse Alzheimer's Disease

Show HN: I rebuilt my hobby mapping platform

Waymo Is Destroying Tesla's Self-Driving Dreams

Anthropic: Industrial-scale distillation attacks on our models by Chinese AI

Neural Correlates of Envy and Schadenfreude

One Lib to Rule Them All: Why we build oneringai open source agentic AI library

Issues with "C99 implementation of new O(m log^(2/3) n) shortest path algorithm"

The Future of Social Media Is Human

AWS suffered 'at least two outages' caused by AI tools

Show HN: MachineAuth:open source Google login for your AI Agent

Is this cloud/local boundary for trading infra reasonable?

Zoye – The First AI Native Workspace for All Your Business Tools

The British get a nosebleed when they get too successful

Liver exerkine reverses Alzheimer's-related memory loss via vasculature

Show HN: Shibuya – A High-Performance WAF in Rust with eBPF and ML Engine

The Era of AI human clone

Show HN: I built a tool track cash flow without the "spreadsheet stress"

Baudbot: Always-on AI assistant for dev teams

Why Frederick Wiseman Was the Greatest Documentary Filmmaker Ever

Anthropic announces proof of distillation at scale by MiniMax, DeepSeek,Moonshot

Writing High Quality Production Code with LLMs Is a Solved Problem

Comments

A visual summary of the 5 prerequisites for improvement

Zwasm: A fast, spec-compliant WebAssembly runtime written in Zig

Americans are destroying Flock surveillance cameras

Life at the Frontlines of Demographic Collapse

I analyzed hundreds of humans vs. AI Tetris games, here's what I found

Real-time security reasoning inside your IDE

Fuss: OverlayFS Without Mounting

Alleged Distillation Attacks by DeepSeek, Moonshot AI, and MiniMax

ESR posits that the C-era is reaching its natural conclusion

Show HN: Emotica – AI that analyzes your emotions instead of just tracking them

Muscle Cathepsin B Improves Neurogenic Deficits in Mouse Alzheimer's Disease

Show HN: I rebuilt my hobby mapping platform

Waymo Is Destroying Tesla's Self-Driving Dreams

Anthropic: Industrial-scale distillation attacks on our models by Chinese AI

Neural Correlates of Envy and Schadenfreude

One Lib to Rule Them All: Why we build oneringai open source agentic AI library

Issues with "C99 implementation of new O(m log^(2/3) n) shortest path algorithm"

The Future of Social Media Is Human

AWS suffered 'at least two outages' caused by AI tools

Show HN: MachineAuth:open source Google login for your AI Agent

Is this cloud/local boundary for trading infra reasonable?

Zoye – The First AI Native Workspace for All Your Business Tools

The British get a nosebleed when they get too successful

Liver exerkine reverses Alzheimer's-related memory loss via vasculature

Show HN: Shibuya – A High-Performance WAF in Rust with eBPF and ML Engine

The Era of AI human clone

Show HN: I built a tool track cash flow without the "spreadsheet stress"

Baudbot: Always-on AI assistant for dev teams

Why Frederick Wiseman Was the Greatest Documentary Filmmaker Ever

Anthropic announces proof of distillation at scale by MiniMax, DeepSeek,Moonshot