So as I was saying in [0] and [1], there is no doubt that properly tuning the compiler for performance can make a significant real difference instead of wasting more money and risking an increase in costs just by throwing more servers at the problem.
Also, If you needed to re-architect the entire codebase to solve a performance issue, either you chose one of the most inefficient technologies / languages or the code itself was badly architected in the first place or both.
Before any architectural changes to the codebase first check if you can get performance gains from the compiler flags and measure it. That should be the industry standard practice for high quality efficient software.
We must learn from excellent SWEs teams such as DeepSeek which frankly embarrassed the entire AI industry due to their performance optimizations and savings in inference usage.
rvz•1h ago
Also, If you needed to re-architect the entire codebase to solve a performance issue, either you chose one of the most inefficient technologies / languages or the code itself was badly architected in the first place or both.
Before any architectural changes to the codebase first check if you can get performance gains from the compiler flags and measure it. That should be the industry standard practice for high quality efficient software.
We must learn from excellent SWEs teams such as DeepSeek which frankly embarrassed the entire AI industry due to their performance optimizations and savings in inference usage.
[0] https://news.ycombinator.com/item?id=43753443
[1] https://news.ycombinator.com/item?id=43753725