My hypothesis is: it's designed that way to generate more profits.
My hypothesis is: it's designed that way to generate more profits.
Because, if you had a machine that could automatically generate excellent code, would you sell access to it, or would you use it to put every other software company out of business?
First, Hanlon's Razor: Never attribute to malice that which is adequately explained by stupidity.
Second, imagine how you train your model. You want a huge selection of quality writing. If you were to input Twitter posts into your model, your model will be dumb as rocks. So you put in literature. Science articles. Technical documentation. If you're Anthropic, you pirate all those books.
The model you get out then is very high quality. Then you ask it to output text and it is going to output this average tone, style, quality from centuries of writing. The text you're getting is 2% shakespearean, 4% old english, 5% japananese english translated manga.
You need to ask it for exactly what you want. If you dont, it will be confusing.
rlv-dan•2h ago
This implies that available training code is good. Consider for example how many GitHub repos are students just trying to learn to code.