frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Writing High Quality Production Code with LLMs Is a Solved Problem

https://escobyte.substack.com/p/writing-high-quality-production-code
6•menzoic•1h ago

Comments

menzoic•1h ago
I work at Airbnb where I write 99% of my production code using LLMs. Spotify's CEO recently announced something similar, but I mention my employer not because my workflow is sponsored by them (many early adopters learned similar techniques), but to establish a baseline for the massive scale, reliability constraints, and code quality standards this approach has to survive.

Many engineers abandon LLMs because they run into problems almost instantly, but these problems have solutions. If you're a skeptic, please read and let me know what you think.

The top problems are:

* Constant refactors (generated code is really bad or broken)

* Lack of context (the model doesn’t know your codebase, libraries, APIs, etc.)

* Poor instruction following (the model doesn’t implement what you asked for)

* Doom loops (the model can’t fix a bug and tries random things over and over again)

* Complexity limits (inability to modify large codebases or create complex logic)

In this article, I show how to solve each of these problems by using the LLM as a force multiplier for your own engineering decisions, rather than a random number generator for syntax.

A core part of my approach is Spec-Driven Development. I outline methods for treating the LLM like a co-worker having technical discussions about architecture and logic, and then having the model convert those decisions into a spec and working code.

carrot5Top•1h ago
For sure, with the latest models, treating the model like a respected professional that needs context and input is essential. usually I get the best results when the context window is right around 70% full
menzoic•1h ago
> get the best results when the context window is right around 70%

I used to be trigger happy with /compact or using the hand off technique to transfer knowledge between sessions with a doc. But lately the newer generation of models seem to be handling long context pretty well up to around 20% remaining context.

But this is when I'm working on the same focused task. I would instantly reset it if I started implementing an unrelated task. Even if there was 90% left, since theres just no benefit to keeping the old context

chalmers•1h ago
Yep! That’s almost exactly the workflow I’ve landed on too. I could not agree more
menzoic•1h ago
It's basically the typical SDLC boosted with LLMs. Especially the part where you can explore tradeoffs and alternative approaches rapidly.
Soupzzz•1h ago
I read you are using Codex and lost interest in the rest of the post
menzoic•1h ago
LOL, honestly I hated Codex when it first came out. It was backed by o3 at the time.

But literally as soon as GPT-5 came out in Codex and with the "high" option, I completely switched from Claude Codex to Codex. Never imagined that would happen so fast.

A visual summary of the 5 prerequisites for improvement

https://mental-models.oldschoolburke.com/five-prerequisites/
1•zdosb•1m ago•1 comments

Zwasm: A fast, spec-compliant WebAssembly runtime written in Zig

https://github.com/clojurewasm/zwasm
1•jedisct1•1m ago•0 comments

Americans are destroying Flock surveillance cameras

https://techcrunch.com/2026/02/23/americans-are-destroying-flock-surveillance-cameras/
1•mikece•2m ago•0 comments

Life at the Frontlines of Demographic Collapse

https://www.lesswrong.com/posts/FreZTE9Bc7reNnap7/life-at-the-frontlines-of-demographic-collapse
1•reducesuffering•4m ago•0 comments

I analyzed hundreds of humans vs. AI Tetris games, here's what I found

https://www.a16z.news/p/i-built-tetrisbench-where-llms-compete
1•ykhli•4m ago•0 comments

Real-time security reasoning inside your IDE

https://open-vsx.org/extension/DevSecAI/Arko
1•mlnas•4m ago•1 comments

Fuss: OverlayFS Without Mounting

https://writethat.blog/fuss.html
2•psarna•7m ago•0 comments

Alleged Distillation Attacks by DeepSeek, Moonshot AI, and MiniMax

https://twitter.com/anthropicai/status/2025997929840857390
5•mike_kamau•8m ago•0 comments

ESR posits that the C-era is reaching its natural conclusion

https://twitter.com/esrtweet/status/2026004594590089484
2•sgt•12m ago•0 comments

Show HN: Emotica – AI that analyzes your emotions instead of just tracking them

https://apps.apple.com/us/app/emotica-mood-tracker-diary/id6757162931
2•tirupati_balan•12m ago•1 comments

Muscle Cathepsin B Improves Neurogenic Deficits in Mouse Alzheimer's Disease

https://onlinelibrary.wiley.com/doi/10.1111/acel.70242
3•bookofjoe•13m ago•0 comments

Show HN: I rebuilt my hobby mapping platform

https://trippi.app
2•velmu•14m ago•0 comments

Waymo Is Destroying Tesla's Self-Driving Dreams

https://neuralfoundry.substack.com/p/waymo-is-destroying-teslas-self-driving
4•truenfel•17m ago•0 comments

Anthropic: Industrial-scale distillation attacks on our models by Chinese AI

https://twitter.com/i/status/2025997928242811253
6•mudil•17m ago•1 comments

Neural Correlates of Envy and Schadenfreude

https://www.science.org/doi/10.1126/science.1165604
2•toomuchtodo•18m ago•1 comments

One Lib to Rule Them All: Why we build oneringai open source agentic AI library

https://medium.com/superstringtheory/one-library-to-rule-them-all-why-we-built-oneringai-689f9048...
2•jhoxray•18m ago•0 comments

Issues with "C99 implementation of new O(m log^(2/3) n) shortest path algorithm"

https://github.com/danalec/DMMSY-SSSP/issues/1
2•dunmalg•22m ago•0 comments

The Future of Social Media Is Human

https://blog.picheta.me/post/the-future-of-social-media-is-human/
1•dom96•23m ago•0 comments

AWS suffered 'at least two outages' caused by AI tools

https://www.tomsguide.com/computing/aws-suffered-at-least-two-outages-caused-by-ai-tools-and-now-...
2•randycupertino•23m ago•2 comments

Show HN: MachineAuth:open source Google login for your AI Agent

https://github.com/mandarwagh9/MachineAuth
2•mandarwagh•24m ago•0 comments

Is this cloud/local boundary for trading infra reasonable?

3•Sultan_Custodia•24m ago•0 comments

Zoye – The First AI Native Workspace for All Your Business Tools

https://zoye.io/
3•anizeu•24m ago•1 comments

The British get a nosebleed when they get too successful

https://www.reaction.life/p/the-british-get-a-nosebleed-when
2•ossa-ma•26m ago•0 comments

Liver exerkine reverses Alzheimer's-related memory loss via vasculature

https://www.sciencedirect.com/science/article/pii/S009286742600111X
6•PaulHoule•29m ago•0 comments

Show HN: Shibuya – A High-Performance WAF in Rust with eBPF and ML Engine

https://ghostklan.com/shibuya.html
4•germainluperto•30m ago•0 comments

The Era of AI human clone

2•Metalcode•30m ago•0 comments

Show HN: I built a tool track cash flow without the "spreadsheet stress"

https://www.opboard.io/
2•wwxoxo•30m ago•1 comments

Baudbot: Always-on AI assistant for dev teams

https://github.com/modem-dev/baudbot
2•tosh•32m ago•0 comments

Why Frederick Wiseman Was the Greatest Documentary Filmmaker Ever

https://www.newyorker.com/culture/the-front-row/why-frederick-wiseman-was-the-greatest-documentar...
2•mitchbob•32m ago•1 comments

Anthropic announces proof of distillation at scale by MiniMax, DeepSeek,Moonshot

https://twitter.com/anthropicai/status/2025997928242811253
28•Jimmc414•33m ago•19 comments