It's also janky as hell and crashes regularly.
An AI startup could still be a useful "resume" to get acqui/hired by one of the big players.
I don't need God in a datacenter. I need help diagnosing an Elastic Search problem.
On the other hand, there's a ton of hype and money looking for the next AI related thing. If someone creates the next transformer, or a different AI paradigm that pushes things forward, they'll get billions.
Also, "model innovation" strikes me as missing the point these days. The models are really good already. The majority of applications is capturing only a tiny bit of their value. Improving the models is not that important because model capability is not the bottleneck anymore, what matters is how the model is used. We just don't have enough tools to use them fully, and what we have is not even close to penetrating the market, while all the dominant tools are garbage. Of course application innovation is the place to be!
Frankly it's bordering on irresponsible to not be targeting acquisition in this climate.
2) ?
3) Profit!
Anthropic and xAI will also make similar acquisitions to increase their token usage.
It's the best time ever to build. Don't work on anything that could have been done two years ago.
Learn the current tools - so that you can adapt to the new tools that much faster as they come out.
no_wizard•3h ago
Model innovation is effectively converging and slowing down considerably. The big companies in this space doing the research are not making leap over leap with each release, and the downstream open source projects are coming closer to the same quality or in fact can produce the same quality (e.g DeepSeek or LLAMA) hence why it’s becoming a commodity.
Around the edges model innovation - particularly speed ups in returning accurate results - will help companies differentiate but fundamentally, all this tech is shovels in search of miners, IE you aren’t really going to make money hand over fist by simply being an LLM model provider.
In another words, this latest innovation has hit commodity level within a few short years of going mainstream and the winners are going to be the companies that make products on top of this tech, and as the tech continues to become a commodity, the value proposition for pure research companies drops considerably relative to application builders.
To me this leaves a central question: when does it hit a relative equilibrium where the technology and the applications on top of it have largely hit their maximal ability to add utility to applicable situations? That’s the next question, and I think the far more important one
One other thing, at the end of the article they wrote:
>Ultimately, businesses won’t rearrange themselves around AI — the AI systems will have to meet businesses where they are.
This is demonstrably untrue. CEOs are chomping at the bit to reorganize their business around AI, as in, AI doing things humans used to do and getting the same effective results or better, thereby they can reduce staff across the board while supposedly maintaining the same output or better.
Look at the leaked Shopify memo for an example or the trend of “I can vibe code with an LLM making software engineers obsolete” that has taken off as of late, if LinkedIn is to be believed
epistasis•3h ago
nemomarx•3h ago
babelfish•3h ago
skeeter2020•2h ago
Businesses are definitely rearranging themselves structurally around AI - at least to try and get the AI valuation multiplier and Executives have levels of FOMO I've never seen before. I report to a CTO and the combination of 100,000 foot hype combined with down in the weeds focus on the "protocol de jour" (with nothing in between that looks like a strategy) is astounding. I just find it exhausting.
adpirz•2h ago
It is still simply too early to tell exactly what the new steady state is, but I can tell you that where we're at _today_ is already a massive paradigm shift from what my day-to-day looked like 3 years ago, at least as a SWE.
There will be lots of things thrown at the wall and the things that stick will have a big impact.
dingnuts•2h ago
oh except, sometimes someone tells me I could use the bot to generate a thing, and it doesn't work, and I waste some time, and then do it manually.
vonneumannstan•2h ago
You're just showing how disconnected from the progress of the field you are. o3/o4 aren't even in the same universe as anything from open source. Deepseek R1, LLama 4? Are you joking?
no_wizard•2h ago
While certainly newer models are more capable on the whole, it doesn't mean I need all that capability to accomplish the business goal.
vonneumannstan•1h ago
no_wizard•1h ago
Again, its not about capabilities alone, (on this, many models lag behind, I already said as much). I follow these developments quite closely, and I purposely said results as to not say they're equivalent in capability. They aren't.
However, if a business is getting acceptable results from older models or cheaper models than capability doesn't matter, the results do. Gemini 2.5 can be best of breed but why switch if it shows no meaningful improvement in results for the business?
If I need more capability or results are substandard, I can always upgrade, but its like saying there's no room for cheaper processors and you'd be out of your mind not to be using only the latest at all times no matter the results.
luckylion•45m ago
> If I need more capability or results are substandard, I can always upgrade
You wouldn't be able to upgrade (and see improved results) if the model you use today was close to equal to the top of the line.
no_wizard•28m ago
Der_Einzige•1h ago
Switch them to good samplers and write the tool calling code to allow tool calls in the reasoning chain and you’ll see close to parity in performance.
The remaining advantages left to closed source come from better long context, and later data cutoff points.
If you don’t believe me let’s see the receipts of your ICLR or NeurIPS publications - otherwise sit down and listen to your elders.
bongodongobob•2h ago
Nah. Maybe tech CEOs. Companies are blocking AI carte blanche at the direction of their security teams and/or only allowing an instanced version of MS Copilot, if anything. Other than write emails, it doesn't do much for the average office worker and we all know it.
The value is going to be the apps that build on AI, as you said.
borski•2h ago
What companies?
epistasis•1h ago
no_wizard•1h ago
borski•27m ago
no_wizard•1h ago
Any company with any sort of large customer service presence are looking at AI to start replacing alot of customer service roles, for example. There is huge demand for this across many industries, not only tech. Whether it actually delivers is the question, but the demand is there.
warkdarrior•1h ago
o1inventor•2h ago
Model providers and model labs stop opensourcing/listing their innovations/papers and start patenting instead.