It's like someone arguing that cheese isn't real. Yes I can go to the grocery store and take a picture of cheese and show it, but what's the point? They can live in their own world. It doesn't change any of our lives. The world is what it is.
And for those who are all "but dur CCP get all ur data" you can use things like AWS Bedrock (at least for earlier versions of Deepseek and Qwen for now) and have more familiar people get all your data. Or buy (at obnoxiously inflated prices) your own HW and not send your data to anyone.
The funniest part of this is that people are often talking about how LLMs are now writing 100% of their code, then also saying that they don't want to expose their code to foreign government exfiltration by using foreign models.
But, uh, if an LLM is writing 100% of your code you have no actual secret sauce to hide from anyone, so why worry about it.
Meanwhile, like I think you suggest, I would assume everyone can generate similar outputs themselves. The idea that you can claim priority on your dream prompt and lock up the market on prompt responses sounds delusional to me. It's not novel invention when you're spit-balling at the same level of abstraction as every fantasy/scifi writer who ever was.
So I also have doubts about the sustainable business model. How long will it take for this fantasy to unravel, as people discover they cannot monetize their AI outputs as much as they dreamed, and in turn cannot afford to pay the AI services they use?
My absolute nightmare is that this becomes a "too big to fail" thing and oppressive/fascist governments decide to back full regulatory capture. That instead of letting it unwind, they grant and support enforcement of an increasingly absurd and arbitrary copyright/patent regime to support this monetization scheme.
At this point I'm trying to believe there's a middle ground where the level of individual capability this unlocks, leads to major discoveries.
However, most of the engineers I respect have gone from being skeptics a year ago to convinced today. I don’t personally know any true holdouts any more. If there are studies that disprove productivity gains more than six months ago, I’m happy to believe that it was true of the AIs that were available at the time. But I’m going to need something much more recent before I disbelieve my lyin’ eyes where it pertains to the AIs available today.
Instantly close the tab as soon as the popup to subscribe to his newsletter pops up.
The current wave of AI unlocked language - the tools are now speaking and understanding. This, on its own, is astonishing progress. Language is the foundation of our culture and society; it is the very technology that got us, as a species, to where we are today. To have tools that can understand, manipulate, and produce it is a massive leap forward.
Once you see things that way, it is clear that we are not in a bubble; we are in a transition. Yes, there is tons of hype and over-investment, but the demand is real, and so is the impact. Unless you are deep in the tech and have that structural depth, it is easy to dismiss. This is like the invention of the personal computer, but with 100x the impact and speed.
The angry polemic that goes on and on and on with cuss words used liberally is just meant to evoke emotion and cathartic resolution to the type of people mentioned above. Not truth.
The thing is, there are a lot of people that find comfort in what he’s writing - primarily because it’s a coping mechanism against how quickly things are moving and a way to deal with being left behind. When you spend time, years, building institutional knowledge and making a whole identity out of it, you obviously will feel bad with the threat of it being commoditised.
I would write against the content of the article but I find it easier and more illuminating to write what he has said before instead. Then it shows how incorrect the guy has been and with what confidence he keeps speaking with.
-------
> While complex, generative AI is a technology that probabilistically generates answers, and has no "intelligence." It is inherently limited by its architecture, and in turn can only get "better" in a linear fashion. I see no signs that the transformer-based architecture can do significantly more than it currently does.
He wrote this in 2024 before reasoning models came out. Remember how ChatGPT was in 2024? Do you think this person is someone who gets predictions right?
> Furthermore, I hypothesize a race to the bottom in generative AI will significantly hamper OpenAI's ability to expand revenue, compounded by the fact that we're approaching the limits of transformer-based architecture.
He wrote this in 2024 and since then Anthropic's revenue increased by 160x to $40 B dollars a year and OpenAI's increased by 6x. Do you think this person gets predictions right still?
> I believe we're reaching the upper limits about what generative AI can do and how accurate its outputs can be,
He wrote this in 2024, do you really think we have reached upper limits? Huh?? What I'm using today is significantly more accurate and 2 tiers above what we had.
> And if there are true industry-changing possibilities waiting for us on the other side, I am yet to hear them outside of the fan fiction of Silicon Valley hucksters.
He says this about AI when we have with all honesty have had industry changing possibilities like agentic coding.
> There are indications that consumers have also lost interest. As pointed out by Alex Kantrowitz’ Big Technology newsletter, traffic to ChatGPT on both mobile and web has started to stagnate, if not decline. In January 2024, ChatGPT had 1.6 billion visits — 11% below the all-time peak of 1.8 billion. This makes it only modestly more popular than Bing, which had 1.3 billion unique visits during that period. On the mobile front, ChatGPT has an estimated 6.3 million US users — or 1.7 times less than the total of new Snapchat users added during Q4 2023.
He agrees with the claim that the consumer interest has declined. Since he said this, there was a 9x growth in active users.
-----
https://www.youtube.com/watch?v=_wStScmT748&t=1s
"AI Bubble Already Bursting?" (8 months back)
https://www.youtube.com/watch?v=T8ByoAt5gCA&t=1s
"A.I bubble is bursting with Ed Zitron" (1 year back)
He's been constantly crying bubble for years now.
-----
> AI video won’t get truly fixed just by waiting a year.
This is what he had said in 2024, and you just need to compare video from then and now to check whether the predictions came true. Why would anyone trust what this guy has to say?
Edit:
> If you’re wondering what the story is, [...] I expect it to be out in the next two weeks [...] I can guarantee you it’ll be worth it, and you’ll be stunned by what I report.
Ok, this takes clickbait to new lows. The headline is trying to sell the teaser here, with very limited meat in the middle of the sandwich.
Maybe AI is different. Certainly, the level scale of investment is on a different order of magnitude. But I'm wary of believing anything about the financial impossibility of AI being sustainable when I've seen such similarly confident arguments proved wrong in the past.
It's a pretty classic business strategy, and not directly comparable to any of the AI companies. There's a reason people compare the current situation to the dotcom era and not Uber. Also, don't take Uber as an example of a slam-dunk VC success story and leave it at that -- plenty of dumb ideas get pitched and funded and go bankrupt for every Uber.
It was only because Uber successfully bulldozed over all regulations that it was able to succeed ... and that was hard to predict before it happened.
Coding seems to be one of the core use-cases for LLMs (as Simon Willison pointed out recently) and even if that's the only real use-case for LLMs, they're wildly useful. I do understand that useful != profitable and that's where I think Ed has a real point: until inference becomes much cheaper these companies cannot be profitable. Some mega-players will pay the API token price, but most will not.
If the AI companies need $X billion in revenue to stay afloat, it doesn't matter if 0.5% or 5% or 50% of that revenue is from transforming the State of the Art. It's 100% irrelevant: what matters is that, transformation or no, these companies won't have the income to pay their bills. And if they can't pay their bills, a whole lot of other companies can't either.
So again, transformation or no, it's still a house of cards waiting to collapse. The only thing that would change that is not more "transformation" ... it's a feature set that lets them multiply their current user base (or multiply how much they charge them) several times over.
I find it quite refreshing in some ways. Lots of people, when they start complaining about this or that aspect of this AI stuff, are wont to add in a little disclaimer that, despite all of the above, they actually really like AI and use it all the time. I assume this is to avoid the scenario of a bunch of pragmatic builders turning up and calmly shipping nuance in the comments (or whatever you call it these days when you get brigaded by a pile of angry keyboard warriors with chips on their shoulder) - and it sure is tiring having to wade through the equivocation.
That's a criticism that'd be hard to level at Zitron! Say what you like about the man, but he's unafraid to appear to take a side.
That said, I think his voice is useful as a counter to the mainstream opinion.
Given the amount of investments, approaching AI from the angle of economics seems correct.
We all have some level of personal experience using AI/LLMs, both chatbots and coding tools, and I personally enjoy using them, but I am sure this experience is relevant in this discussion.
I also enjoy luxury hotels, gourmet food, jet skis and helicopters, but this is not something I indulge in often because of the cost-utility ratio.
The real cost of AI may or may not be lower than its utility. The bet is that utility is increasing while cost is falling.
> It's like someone arguing that cheese isn't real
I agree with your first statement (any being you) because of your second statement.
Here is the report:
https://www.faros.ai/blog/ai-acceleration-whiplash-takeaways
And my commentary:
EDIT: In fact, parent comment has a link to some numbers.
[EDIT: Most] people don't want to go through the numbers. Ok. But there's a history here. When people don't want to see the numbers, certain kinds of things tend to happen.
Code acceleration is great, but.... something precedes that. Vision and strategy re. expansion of offerings and businesses. Once a firm reaches maturity in what it offers and is only touching the edges - this code acceleration is literally useless when you factor in all of the trade-offs.
This is a good thing - it means fat and slow incumbents are sitting ducks to be out-witted by creative and imaginative founders, which is healthy for a well-functioning economy.
Now the economics of existing frontier models are not sustainable - its looking like a mix of the airline (supersonic vs subsonic) and EV industry with China in the background providing decent offerings at much lower prices.
It's not that the utility of it put in question. What is however a giant question mark is how the heck any of the big AI companies are ever gonna get that ROI? Given how many of us are becoming more and more fine with local models that run just fine especially on a good enough computer which most developers have anyway...
Why should someone pick Opus 4.8 when Qwen3.7 Plus produces similar results for about 1/20th the cost.
That sort of pricing disparity is across the board. But further it's becoming more and more apparent that they are doing more with less parameters. That's what's giving the local models their super powers.
This, combined with his extreme ignorance, makes him unreadable. The only reason people read his stuff is because it validates and confirms their own anti-AI beliefs. It's why every time he publishes an article, it reaches the front page in an hour or less.
Extreme ignorance?
How are they undeniable? They're very deniable. One example is the (seemingly) increasing maintenance costs for AI-generated code[1]. Another is the cost incurred by everybody reading AI slop instead of actual communication.
I don't have hard data as to whether these cancel out the benefits, but it's not as rosy as some seem to think.
[1] After years of people understanding that LOC is not only a poor productivity metric but also a negative indicator of code quality (shorter code for the same thing is better), we now have people touting how many LOC their LLM agent is generating. It's like everyone forgot what LOC actually represents and what it means for long term maintenance costs.
No.
> Yes, his macro analysis correctly identifies a massive financial risk but
So, he's right.
> but his incessant pessimism
Realism. You already said he's right.
> the incredible ground-level utility that many of us on HN celebrate every day through undeniable, massive productivity gains.
No.
> At this point I'm trying to believe
Why are you trying to believe at all when things are so great? I thought you were busy celebrating all that undeniable, massive productivity gains.
The fact he’s never reflected on the glaring failures in his analysis tells what we need to know about his intellectual integrity. There’s truth in some of his words about financial risk, but if you can’t acknowledge that there’s upside too, you can’t evaluate risk properly either.
I find it difficult to take him seriously.
Do you think it's not slowing? Do I miss anything really important?
My understanding is that we have now is incremental improvement on thinking models which appeared more than a year ago. Of course, a breakthrough might happen, but I don't see one yet.
Which LLM company are you affiliated with/evangelizing for? Please disclose.
No, he's not, he's making tons of money every month from his Substack subscriptions. In fact, the AI bubble popping would be the worse thing ever for him, he would be out of a job.
Just like the who have predicated the US dollar will collapse any-moment-now and which pushed gold for decades.
Funny how people always say "oh, you are an AI lab, of course you are going to hype AI", but never "oh, you make sooo much money from predicting the collapse of the AI bubble..."
I do value having some naysayers in the mix generally, because we do need balanced critique in what is otherwise a very frothy hype cycle. I just don't think he's making sound arguments, and that's even assuming you even agree with his premises in the first place.
My biggest gripe with his napkin math is that he treats inference gross margins as something novel that you can't compare to normal SaaS margins. He's right in part: the constant carousel of R&D costs from model training, related infrastructure buildout, and other adjacent costs required to stay competitive do change the analysis a bit.
But he takes this way too far when he says this is structurally different from normal SaaS margins. The business model definitely doesn't look like Dropbox, but it absolutely looks a lot like AWS, especially early AWS, CDNs, telecom, etc. I can speak to the telecom bit personally, since it's been over half of my professional career as an engineer and, in this specific case, also as a founder. You can have a brutally capital-intensive infra business where profitability depends on utilization, oversubscription, peak-capacity planning, segmentation, and recovering capex over time.
The math he presents gets even more questionable as we see explicit segmentation happening for cost-saving reasons. Many forward-thinking orgs are waking up to the fact that they don't need to use the best, most expensive model for every task. They can route easier tasks to cheaper models, use caching, batch non-urgent workloads, and reserve frontier models for the subset of work that actually needs frontier intelligence. That directly undermines his claim that providers always need to chase frontier intelligence in order to maintain current demand, utilization, and pricing curves.
But from the article I linked back in March 2024:
"Generative AI models are expensive and compute-intensive without providing obvious, tangible mass-market use cases. Murati and Altman's futures depend heavily on keeping the world believing that development and improvement of their models' capabilities will continue a rapacious pace of progress that has unquestionably slowed, with OpenAI admitting that GPT-4 may be worse on some tasks.
As I've written before, hallucinations are a feature not a bug. These models do not "know" anything. They are mathematical behemoths generating a best guess based on training data and labeling, and thus do not "know" what you are asking it to do. You simply cannot fix them. Hallucinations are not going away."
Since then:
- hallucinations are dramatically less of a problem
- several mass market use cases have emerged, most notably coding
- rate of progress has increased
> I believe that artificial intelligence has three quarters to prove itself before the apocalypse comes, and when it does, it will be that much worse, savaging the revenues of the biggest companies in tech. Once usage drops, so will the remarkable amounts of revenue that have flowed into big tech, and so will acres of data centers sit unused, the cloud equivalent of the massive overhiring we saw in post-lockdown Silicon Valley.
We have seen 8 quarters since. Has any of that come to pass?
The business model does appear to be viable for these labs. But that viability comes because they aren't wasting a bunch of R&D money developing worthless products like AI video production.
Regarding your comment about the business model—the people in Silicon Valley are not stupid. They know the playbook; we've seen it with social networks. The issue isn't the business model itself; it's that these companies need to dominate the market, and the big players are competing for that on a global scale. It's the exact same playbook that played out in financial systems and social networks, and now it's happening with AI. Once these technologies are deeply integrated into enterprises and the global economy, these players will dominate the market for decades to come.
I can assure you, the people running those companies are smarter than you, me, and the author of this article."
If I were to make a prediction, it's that ultimately these cheaper models are going end up eating their lunch. I don't think they'll make back the money they've invested and once that reality hits investors, those two companies are sunk.
That, however, is not the end of AI. Nor will it be the end of Nvidia/micron/etc. It will more just be a localized bubble pop that doesn't eliminate the product from the market.
These models are building deep integrations into companies and the entire economy. Once that stabilizes, it will be like the electricity grid—pumping tokens to fuel decision-making across the entire global society. Good luck unplugging from that.
Furthermore, there is a massive geopolitical aspect to it: those who are already on the Western financial and technical stack will get integrated even deeper now.
Much like the electric grid, what we are seeing is a convergence on standard APIs. For example, most of these cheaper models are hosted using APIs compatible with OpenAI. It's not a matter of rewiring your electric plug to work with a different socket standard, instead it's just the process of plugging it into a new socket.
> Furthermore, there is a massive geopolitical aspect to it: those who are already on the Western financial and technical stack will get integrated even deeper now.
Certainly the Chinese models appear to be some of the best when it comes to competition, but they aren't the only ones. There are European models and other US based models which all run for cheaper.
The biggest competitors aren't small models, they are just the traditional players that already have an "in" with enterprises. That I think will start to show its face once this initial round of buildout is complete, which may not be for another 5+ years.
This is fire erasure
/s
This is often repeated but comes from ignorance mostly. You have * zero * reason to believe inference is costly other than just vibes. If you go by data and intuitions - the margins are high.
This kind of thinking really reinforces my belief that people have no idea and are using this whole [AI is not profitable and too costly] thing as a cathartic way to deal with immense progress.
ElFitz•1h ago
metadat•1h ago
Does the truth normally lie somewhere in the middle of it all?
kunai•1h ago
viccis•48m ago
Usually does when you decide what constitutes extreme.
techblueberry•1h ago
sumeno•1h ago
JesseTG•51m ago
d33d•46m ago
He's in the media business... its in his interest to amp things up.
JesseTG•35m ago
nyeah•36m ago
Of course that mentality is obsolete. Now we all have infinite access to perfectly correct information via the internet.
lowmagnet•23m ago
nyeah•39m ago
ElFitz•9m ago
I’m not attacking the piece. I’m not saying it’s right. I’m not saying it’s wrong.
What I’m saying is, the tone made it hard for me to judge the arguments fairly, despite finding some of them convincing. And as much as I dislike it, persuasion does partly depend on how an argument is made.