"But when I'm honest with myself, I can't point to evidence that an LLM writes worse code in a messy repo. And that bothers me. I've watched agents make correct changes in files I'd be afraid to touch."
If the author is afraid to touch the files, how does he know the changes are correct?
It is very tempting to believe code is a pure means to an end, and that the end can be exhaustively represented by some combination of a test suite, a prose technical design, etc. And in a single round game version of development, that gets close to true. But in the multi-round situations that matter economically, some bodies of code act as the measure for other bodies of code. This is most evident when we see generated code fail to typecheck.
Writing non-trivial software is not an exercise in translating a clearly defined end into precise means. It is solving a series of economic coordination problems over time.
bwestergard•38m ago
If the author is afraid to touch the files, how does he know the changes are correct?
It is very tempting to believe code is a pure means to an end, and that the end can be exhaustively represented by some combination of a test suite, a prose technical design, etc. And in a single round game version of development, that gets close to true. But in the multi-round situations that matter economically, some bodies of code act as the measure for other bodies of code. This is most evident when we see generated code fail to typecheck.
Writing non-trivial software is not an exercise in translating a clearly defined end into precise means. It is solving a series of economic coordination problems over time.