Going full AI engineer, not touching code anymore

https://max.gp/writing/going-full-ai-engineer-not-touching-code-anymore/

39•maxheyer•49m ago

Comments

hootz•26m ago

But the question is, will this atrophy our ability to write and understand code? Is reviewing enough? Won't we inevitably get lazy and stop reading enough code?

Reading always was a worse way to learn than doing. When we stop doing, we will have to triple our reading (if that's even enough), but will we do that or just trust the agents more and more?

maxheyer•22m ago

Trust creep is the part I worry about too. Today I read every diff.

Glohrischi•21m ago

I think I read so much code over the last 15 years, that its not a thing i will lose.

But at least for me, I really read a lot of code, a lot of PRs from others and evaluated them.

jrflo•18m ago

I wonder if people thought something similar when moving from writing low level machine code by hand to high-level languages. No one looks at the actual machine instructions anymore! Will our ability to understand low level instructions atrophy?

hootz•13m ago

And they did atrophy! I only know assembly because I deliberately studied it and practiced it for reverse engineering. But in my view, no longer being able to understand high level code is a much, much bigger problem than not understanding assembly.

hedgehog•11m ago

There's clearly a range of outcomes, from the people pushing the envelope in system performance (look at the state of networking ASICs and the code to drive them) to people adopting hyper-complex frameworks for cloud orchestration and job coordination that turn a few screens of code into system so slow and complex it needs a team to maintain it. It's all happening at once, and it's probably ok. Keep in mind, knowing how to write assembly and knowing how to get good performance out of a CPU are two different things, modern chips are insanely complicated and the mental model is a long way from the days of the 386 or 68030 (let alone pre-MMU computing).

righthand•11m ago

This analogy is always weird to me, the change your talking about was still deterministic. Switching from programming to describing programming and having a machine make a best guess is not a deterministic shift as previously happened.

dijksterhuis•9m ago

[delayed]

snek_case•7m ago

The difference is that LLMs are not compilers. You can't trust the output to be correct. They routinely make bad design choices. If you're prototyping some kind of throwaway MVP, you're just sketching something, it's probably fine not to review it. If you're trying to build a piece of software that's going to survive for years, why are you doing this? The tech is clearly not yet fully mature.

Just yesterday, I was trying to use Claude Opus 4.7 to debug an issue in a program that I wrote, and its solution was to remove a feature, change the design to eliminate the problem without consulting me. I only found out that it had removed this feature through testing. Imagine not reviewing things like that. How can people argue for this with a straight face? We'll probably get there eventually, but there's no need to rush before the tech is ready. Doing that is just being clueless.

cguess•7m ago

It did atrophy, but more importantly understanding what's going on below the code you are writing, even if your project isn't assembly, also atrophied. When you're coding an Electron app with Javascript you're so far removed from any concept of the efficiency a modern CPU can achieve you stop caring about optimization. Speed and memory management don't matter and then you get... well, the modern web.

hootz•4m ago

And the modern web being usable is basically thanks to the people who understand lower level optimization.

linksnapzz•6m ago

The move to HLLs created new classes of problems that assembly programmers didn't face; so while the # of problems that necessitated understanding hw didn't shrink, it did relative to the list of things that could now go wrong.

4rachelp•25m ago

Feels AI-written as well and a recycling of 1000 versions of this point.

droidjj•19m ago

Yes, it reads like every goddamn LinkedIn post these days.

snek_case•6m ago

I wish the tech field wasn't so full of clueless grifters. The most frustrating thing is that this kind of people, the kind who loudly and confidently assert bullshit claim based on insufficient knowledge, have a knack for positioning themselves in positions of power.

jsdalton•14m ago

It’s aggressively AI written. I’d rather just read the prompt.

It’s unfortunate because many of us are going “full AI” when it comes to coding. And there are some true gives and takes that are interesting to explore.

Sadly, this piece reads like pure hype.

liveoneggs•25m ago

This would be fine if LLMs had a dedicated language or a well-tuned abstraction that suited their abilities well. Right now we are trying to fit our square languages into the LLM's round hole.

hootz•16m ago

Where does my trapezoid-shaped highly compressed and ambiguous natural language go? Well, of course, it goes in the square hole!

righthand•24m ago

I haven’t touched LLM coding beyond using TabNine originally and that has since been disabled. My LSP is good enough and I haven’t fallen behind on delivering code before deadlines. Not sure what the constant hype is about.

And look I wrote all that in 1 paragraph without bullet points detailing anything over and over.

7777332215•18m ago

People using google translate to speak a language. Except the language contains more complexity, meaning, and compression that the writer can't describe without using more words/tokens than it would be to actually write with the same precision. They could instead use better tools to achieve equal results with higher confidence.

ori_b•23m ago

Sounds like they're not touching writing either.

aselimov3•23m ago

I think the main issue with this approach is that your solution space eventually narrows. You go from formulating unique solutions to flagging things that look wrong and then just picking an alternative LLM generated one. I’ve noticed this recently as all my side project work is being done by hand. If I do ask an LLM another question about something small it will offer solutions but doesn’t offer the solution I think makes sense in the architectures I’ve written. Slowly your brain just gets trained to mid thinking like an LLM. At least that’s my theory and I think there was some study that showed that interacting with an LLM at the start of a problem significantly narrowed the solution space to something sub optimal

kbuchanan•18m ago

I agree this is a problem too, but I suspect mostly for novel(ish) software problems. For me, LLMs have expanded the solution space, because, while I used to be decent with SQL in Postgres, now I'm operating on a whole new level—the LLM's ability to make sense of Postgres' full suite of options, and the performance implications of the queries, is far beyond what I could have accomplished.

CharlieDigital•13m ago

It's a mixed bag.

I've been working with Apache AGE (openCypher in Postgres) recently and found that left to its own device, the agent wrote terribly inefficient queries, even when given a test harness and instructions to examine the result of the query plan.

It just didn't seem to understand the graph traversal, even when given the graph schema and small snippets.

I ended up hand-writing the structural "skeleton" of the main query that I performance tuned to a certain extent and then handed it over to Codex to finish off. Once it had this skeleton to start from, it was able to do a much, much better job of writing this query.

aselimov3•7m ago

That’s a good point but I think this article above would push back on this point slightly. I certainly am able to do a lot more with LLMs because it can produce a passable solution in a lot of places but I’m not sure it expands my solution space really. I tend to separate what I’m able to build and what I’m able to solve. I can build a frontend with an LLM but I don’t think I’m able to solve frontend engineering problems.

CharlieDigital•17m ago

    >  Slowly your brain just gets trained to mid thinking like an LLM

Regression to the mean.

I am doing a lot of the code reviews on my team and I can see that LLMs have a hard time (or are perhaps specifically guided) with OOP and writes a lot of `private static` utility functions. A lot of duplicated small utilities that can end up becoming a maintenance nightmare should the behavior need to be normalized/fixed. String key formatting, for example. JSON serialization behavior, another very common one. At a higher level, it needs very active guidance to search for existing code and re-use interface contracts via DI consistently (we have instructions and skills for this, but hit or miss on usage and adherence)

It generates very repetitive code and doesn't have the wits to refactor is in a way that is reusable, even in simple cases (basic JSON serialization).

It really dislikes to create object and type hierarchies on its own (e.g. move the repetitive serialization to a base class) and prefers to write one-offs. Works, but not very elegant; lots of duplication and touch points for regressions.

It also has a tendency to write more "verbose" solutions where sometimes simpler ones will work.

yehosef•14m ago

can you say which llms you're using? have you tried different ones and how were they?

CharlieDigital•11m ago

Team is a mix; I'm personally using Codex, gpt-5.5 high fast + Claude Opus 4.6 (occasionally Sonnet 4.6).

Mix of CLI (Codex) and GH Copilot (if I want active line selection).

We have a set of custom skills and knowledge base as well.

sebastianconcpt•9m ago

Predictable yeah. We're far better to find the right overarching narrative of the architecture and the necessary intermediate layers of abstractions. LLMs will invent verbal structs that sound okay but not have the elegance of a senior OOP. Although once the good foundation is there, agents can be great at extending and maintaining features on it.

spwa4•9m ago

This sounds very much like a problem of the context, that should be solvable by having a file with instructions on how to do generic utilities somewhere in the code (e.g. AGENTS.md)

CharlieDigital•5m ago

We have; but it's also not easily practical because there is some judgement involved and it's not really feasible to point out all of the edge cases (bloating the context).

What is not clear to me is whether this is inherent desirable behavior on the part of the agent or not. Why? Because for the agent, the code is more isolated and its immediate changes have a lower blast radius by internalizing some behavior (`private static`) versus touching a shared method or hierarchy.

I can see why the underlying models may be steered this way, but it creates a different kind of problem when things really should be shared.

maxheyer•16m ago

Yes, and LLMs are also very supportive, you get less critical feedback.

phito•16m ago

> If I do ask an LLM another question about something small it will offer solutions but doesn’t offer the solution I think makes sense in the architectures I’ve written.

This is my experience as well, and I've been using Claude Code a lot.

Extremely impressive tools, but they're like fast food. They will solve your immediate problems quickly and cheaply, but you're going to have issues on the long term if that's all you use.

njovin•4m ago

Have you tried drilling into the reasoning when this happens? This is why I usually leave it in "Plan Mode" and when it proposes a solution that seems unusual or unexpected, I point out why I think it is and ask it to justify it's position.

Sometimes I get the "you're right!" response, but often it will also explain why it made the decision it did, and it's rational enough that I accept the new approach.

It's still very much like a junior dev in this way - pretty good at 'just make it work', pretty good at monkey-see-monkey-do, and occasionally surprises you with something novel (to you).

hnthrow0287345•3m ago

Fast food might be the right approach if it's a business that prefers speed to maintainability, and they might move towards that with LLMs. I wish more developers would go with it instead of fight it since it's not like it's a personal failure you can't convince the business to slow down and prefer quality.

The quality of the craft can live on in open source and personal projects.

DeathArrow•10m ago

> You go from formulating unique solutions to flagging things that look wrong and then just picking an alternative

Not really. You write the specs, you write the architecture. You ask the model to implement your solution.

aselimov3•1m ago

But the specs can’t be fully detailed or else they would just be code which is the point the article above makes. You can certainly limit where the LLMs have freedom by writing more detail but obviously that defeats the purpose of LLMs. What I mean more broadly though is more along the lines of baked in solutions. The LLM will generally prefer certain architectures/libraries/patterns. If you develop a spec alongside an LLM (which I think most people do) you will get pushed towards those. You kind of close yourself off to other ways of solving a problem.

You can stop this by detailing exactly what solution you want but I think continuous leaning on the LLM will lead you into a confined space.

hibikir•10m ago

If you have a very specific decision in mind, there's no need to ask the LLM at all and hope it guesses it. Depending on how sure I am, I will either explain what I want vaguely, or even phrasing things as questions, or be direct and inflexible. It ends up writing the code I would.

It's not all that different from what happens if I am delegating a larger task to another human being. The only actual difference there is that wit a person, I'd rather underspecify as to give them ore of a sense of freedom and opportunity for growth, while the LLM isn't learning anything, so, if anything, it's easier.

dboreham•10m ago

I'm not sure about this. I think it's much the same as when you work in an organization comprising humans, and many of them are not so right about everything. Perhaps this will be familiar :)

I've noticed that Claude with the current model can act most of the time like a trusted senior engineer sounding board for me. However on occasions we disagree. Recently we had a stand up argument about a proposed solution to a potential security risk in a system. I realized Claude had taken on a kind of "corporate culture" thinking on the problem (which was to do with use of a vault/broker to hold user credential data) that would be appropriate in some contexts, but wasn't in ours. The experience was totally like arguing with someone on the corporate Slack, but in the end I control what gets deployed so I won. But I'd say 90% of the time I get as much value out of Claude on these discussions as I would if I knew one competent person for each of the subjects we discuss, and had called that person for advice. Of course even though my network is...vast...I just don't know a competent person in every single field.

supermatt•18m ago

People bragging that they "dont touch code" and only "argue" with agents are reinventing the slowest possible IDE.

Obviously the agents are great at producing large chunks of code, but they often make minor and sometimes trivial mistakes which need amending.

Typing something like "in src/auth/session/token_manager.ts the refreshTokenExpiry variable should be refresh_token_expiry. update every reference and make sure nothing else changes" and waiting for the LLM to do its thing takes longer than opening the file and doing the rename yourself.

If you are describing microscopic edits in natural language you are not avoiding coding. You are coding through an extremely verbose, lossy interface with higher latency and lower precision.

EDIT: flagged?

qaq•4m ago

Codex and CC are actually getting better at reviewing code and flagging issues. False positive rate dropped fairly significantly. Also obviously might be very personal preferency but creating clear specs and iterating on specs really helps to crystalize the approach I want to take to solving a given problem.

eranation•17m ago

I’ve been on and off and on the bandwagon on this.

My current verdict. When used properly, and if you actually read the code, pushback when needed and don’t get approval fatigue, and also practice hand coding here and there, you can definitely get immense productivity boosts while not compromising on quality. The issue is blindly accepting AI code, architecture, “are you sure this is prod ready?” “Ok ship it” is not going to end well. Too many times I caught the top SOTA models, 1M context, effort on max, suggest or do terrible things and say it with confidence.

TLDR if you read the code (and understand the architecture), and know when to push back. This can work amazingly well.

Otherwise you will gradually build a Rube Goldberg machine.

iririaqq•11m ago

How many people actually have this discipline?

I think most people dont want that discipline either. engineers in the west live in a world of convenience. The loss of discipline is almost expected. Apart from those who really love it and take pride.

And we’ve seen that outsourcing to low cost regions = no discipline.

Nesco•17m ago

After a time the codebase feel foreign…

JBorrow•17m ago

I just can't get behind this perspective, perhaps because I am a 'bad prompter', simply due to the lack of capability from current models. I end up rejecting them, asking them to implement again, they spew out tons of crap code instead of a 10 line fix, and on, and on.

Note that I'm not saying the agents are useless. They certainly write a lot of code and sometimes it is good. But I don't think you can get away without touching code yourself simply because at some level it is often the most concise way to get your idea across. I find it best to implement a core set of changes by-hand, and then ask the system to e.g. replicate them to other dependencies. I genuinely don't understand how anyone can work on a system with any complexity, and get the results they actually want, without touching code.

WesolyKubeczek•16m ago

Feels like what one of those morbidly obese people riding electric scooters would write about not missing walking one bit. God forbid they end up in an area where accessible alternatives to stairs are not a thing.

voidUpdate•15m ago

What's interesting to me is how many people have found out with the LLM boom that they don't want to be developers/engineers, they want to be managers, delegating all the work to another entity and checking in occasionally to see how the coding is coming along

DeathArrow•7m ago

I do not manage LLMs, I manage people.

I always considered myself a builder, a creator. The end result is interesting and it's important for me the end result to be just as I envisioned it.

Typing code isn't exciting. Thinking, planning, finding solutions and driving the LLM to implement the finished product is.

voidUpdate•2m ago

I'm happy for you that you enjoy being a manager. I'm just saying that a lot of people seem to have found they prefer managing another system writing the code for them asking it nicely to do things, rather than writing code myself. I'm in the camp of enjoying typing code, and I do not want to become a manager, either of LLMs or people

chrisrickard•3m ago

I’ve found it interesting too.

It’s like I spent 20 years mastering painting in watercolour… nerding out on other painters, canvas options, even the backstory on some guy that makes a specific paint etc.

And I don’t regret any of that. but now I’m just loving creating my art 100x faster.

I thought I loved the craft (and I did) but more, I loved the product.

scotty79•1m ago

You seem to think you can only be an engineer if you are holding the shovel yourself.

Building software consists of many parts. I love each individually. I was always a bit dissatisfied about how they interact and how they block each other. I love puzzle solving, I love debugging, but I kinda hated building software because it depended on those elements and I'm not fond of context switching.

I can love building software now.

pavel_lishin•13m ago

Isn't that just being a manager?

the__alchemist•12m ago

Everybody knows you don't go full AI engineer.

ssenssei•2m ago

You know any AI war heroes? You went full AI engineer, man. Never go full AI engineer. You don't buy that? Ask Roy Lee, 2025, "Cheat On Everything." Remember? Went full AI engineer, went home empty-handed.

mariopt•11m ago

I’m forcing myself to write code.

This idea of reviewing an architecture that you never coded is just a fantasy.

At some point in time, me and a lot of people, thought that using Redux was a great idea until we had to manage verbosity and middlewares. Now we had to deal with the consequences of our decisions and we learned.

I also think this article is just a rage bait

yehosef•9m ago

can an architect review a system he didn't design or build?

yehosef•10m ago

With all the people saying that you're going to have problems because the LLM is not good at refactoring or large code bases or OOP, etc. the point may be that if you're working to develop skills, an LLM herder might be a good one. Even if the models are almost, but not quite, good enough yet - they will be.

When looking for a career move you might want to focus on the trajectory more than the current state.

everyone•10m ago

Is anyone doing this who is not making CRUD web-apps or other very common types of programs for which there is masses of training data to copy from the net?

I tend to do oddball stuff (eg. software for custom arcade machines, AR games) and any forays I've made into AI code have made it seem like a total waste of time for me. (ie. I will spend much longer trying to get AI do to a shitty job, compared to just doing it myself)

the__alchemist•3m ago

It is a minefield for structural bio/chem.

lenerdenator•9m ago

We're basically doing the same thing where I'm at.

I do see the long-term goal as being the elimination of most engineering positions. If you can have an architect - or, even better, some solution manager talking directly to the customer - just prompt a solution into existing and monitor most of it, you can really reduce the headcount.

add-sub-mul-div•8m ago

Most of the AI submissions are from people who are here to self-promote their own content and have little to no other participation. It's like this stuff appeals to a different audience than made up this site 5 years ago. Everything and everyone feels so nonsubstantive now.

fnoef•7m ago

"The fun part was never lifting weight. The fun was in being buffed with a 6-pack and picking up women. The fun is in bullying those who are weaker than me. I never liked going to the gym and lifting weights. What I actually love is the result of the lifting, but I never wanted to do the hard work".

Not saying this is a 1:1 parallel, or that writing is what makes you a good coder, but I do wonder - if you remove the entire process of making something, what is left there? Sure, people keep arguing that architecting or "managing" AI agents is the actual moat, but is this really? If you do not participate in the craft itself, how would you get better?

I mean, I have amazing ideas for wooded furniture I'd like to build. I doubt that if you give me an LLM connected to a router that I would be able to build something by "architecture, review, push back, and argue". I simply lack the knowledge to make something out of wood. And the only way to get this knowledge is to go and make something and not "architect, review, push back, and argue".

the__alchemist•6m ago

The author appears to have gone full AI blog writer as well.

Havoc•5m ago

Same here. It’s working well, though only using it for personal projects.

The one project that did give me pause was an S3 Server. Not entirely sure I trust that…

Toying with the idea of sticking a proxy in between that runs the vibe coded S3 in parallel to a classic server and see if the results track

lysace•4m ago

My career:

Year 0-12: Coding lots, learning lots.

Year 12-22: A product I built really took off. Building via other enginers. Lots of product management/politics. Atrophying coding skills.

Year 22-28: I'm tired of building by talking. Re-learning coding.

Year 28-: "Here's this thing called Claude Code".

I wonder if there'll be another cycle that involves someone like me.

reeredfdfdf•3m ago

The problem I have is that I find generic code reviews really, really boring. With agents I write less code but read more code that I didn't write myself, which makes the job more boring than it was.

I would rather let AI do the code reviews and focus on test coverage.

NATO Has Seen the Future and Is Unprepared

Linux Foundation finds AI security readiness is now the top obstacle to adoption

How Storm-2949 turned a compromised identity into a cloud-wide breach

Show HN: GobanFTP – a game of Go that lives in FTP listings

Show HN: Design.md Taste Generator

Claude Managed Agents on Cloudflare

Floats are dead Long live floats

Our Journey into Social Exile. X, Threads, Reddit and LinkedIn Ban

Show HN: Guitar Guru – A guitar valuation app using ML

Why is almost everyone right-handed?

Show HN: Efficient Modeling of Nonlinear Dynamics in Nonstationary Data Streams

Licensing Personal Projects

Ur Dream Founding Engr?

The Stack Behind Clicky

Writes and Write-Nots

The Billionaire Who Wired San Francisco

Tristan Davey's Punch Card Archive

Show HN: CLI Security for Agentic Development

New Surface devices built for business and AI acceleration

Show HN: Nexus – free, self-hosted business suite for service teams

New syslog formatter for Golang (RFC 3164/5424/6587 compatible)

VIX

Microsoft Delta

Ask HN: What are your plans for the AI future?

Show HN: RoBrain – Shared memory for AI agents, with rejected alternatives

What Copy Fail looked like on a minimal OS

Show HN: WiFi Heatmap – free, browser-based predictive WiFi planner

There's no such thing as "age verification"

Chart: How Tech Giants Make Their Billions (2017)

Powertracker, 29 GW of announced AI capacity mapped to host counties