AI is too expensive

https://www.wheresyoured.at/ai-is-too-expensive/

61•crescit_eundo•52m ago

Comments

jqpabc123•46m ago

It's just getting started. You won't find out what the real, actual cost is until after you build it into your workflow.

In other words; right now, we're still in the "bait" phase. The "switch" comes later.

happyPersonR•43m ago

Next year is gonna be when the “switch” comes lollll

ReptileMan•38m ago

Next year we will have new deepseek.

happyPersonR•13m ago

I see a couple of possibilities

1) someone deepseeks deepseek lol:

Generates their own weights and figures out a way to determine all of the intermediate states.

2) places realize there’s real risk with using a model that might have things baked into it that produce specific flaws that could be security bugs, but only under certain conditions.

gdulli•24m ago

And the switch will continue, it won't be a one-time event. Like how the price of Netflix etc. keeps going up periodically.

If people's dependence on their streaming service keeps them captive, just wait until people have gone 5 years without doing real work.

b65e8bee43c2ed0•42m ago

Chinese 1T+ models are being offered at a fraction of GPT/Claude cost, and the margin is healthy enough for dozens of providers to compete, so I find it highly likely that ClosedAI and Misanthropic sell tokens at massive markup. they just still bleed billions on their free tier and san francisco salaries.

citrin_ru•36m ago

Why we should assume that whoever offers these Chinese models makes sufficient profits and will not rise the prices eventually too?

han1•29m ago

Chinese models are state-funded and not concerned with taking profits.

larme•18m ago

Some of these models are open weight. You can try hosting them and do the price calculation yourself.

They also publish papers talking about how to save kv cache and computation powers. Because currently they don't have the most powerful nvidia cards, training and inference efficiency is very import for them.

simianwords•38m ago

If AI is too costly: bubble will burst because costs are unsustainable.

If AI is too cheap: bubble will burst because you can run them locally and data centrs are not needed.

If is it in-between, AI companies make too much money and they make too much profit which is bad!

I don't think this guy is a serious commentator.

harimau777•34m ago

I mean, those all seem like true statements.

sophacles•9m ago

This isn't an accurate summary, even at a 50Kft view. Are you a serious commentator?

sailfast•34m ago

These are all just bets that eventually someone wins anyway, right? Adoption is good but marginal revenue doesn’t matter if and when these models and solving world hunger - or have created the next yakuza mega corp that governs the world - right?

Feels like an unspoken rule here. Everyone wants to own a chunk of nuclear weapons and it doesn’t matter whether it’s profitable. You just need the nukes to survive and have a seat at the table

hypnodrones•32m ago

The rug pull on users is bound to happen and it will involve advertising.

add-sub-mul-div•20m ago

Algorithmically and seamlessly weaving undisclosed advertising (or other editorial content) into conversational output is their holy grail. It's the endgame. There's a reason they're pushing so hard.

Havoc•31m ago

It’s a similar bet as Uber. They also started out with numbers that make no sense - overpaying drivers and undercharging users

The math may look questionable but there are also senior people talking of automating all white color work in the next couple years. Even if that estimate is miles off on both time and % it’s still trillions. So crazy as the numbers seem it could still work out

empath75•16m ago

Cars and gas and human labor do not get cheaper over time the way that computer hardware does.

overgard•30m ago

More people need to read Ed, especially tech journalists. I feel like he's one of the rare few people that are actually speaking about the industry honestly.

bensyverson•30m ago

If you're older than 30, you've seen this play out before… This is just how the VC game works. Cross-town Uber rides don't stay $5 forever.

The bright side is: this is a golden era of subsidized tokens. It will not always be like this, so now is the time to churn out your passion projects.

Aeolun•26m ago

I think the good news is that we’re not at peak cheapness for tokens yet, but companies like deepseek show that it is perfectly possible.

BoredPositron•23m ago

Probably the worst comparison I have seen on the topic. gz.

TheSkyHasEyes•12m ago

Eventually you come to realize the more things change the more they stay the same.

edwin2•17m ago

Uber was an application of technologies sitting on top of many iterations of performance optimizations. (Think: the difference between 2009 internet speeds versus 2019 internet speeds. Or, 2009 smartphone specs versus 2019 smartphone specs.)

You could imagine a Moore’s Law-esque cheapening of the tech that coincides with the business raising their prices. That might look like a continuation of simply “using the tools” on the surface, but on the inside it would spell a gradual, meaningful increase in margin

Aurornis•15m ago

Uber is a good comparison because everyone was predicting the demise of ride sharing as soon as they tried to become profitable.

The subsidies went away gradually and the prices leveled out in a spot where the services are heavily used. Uber became profitable. Ride sharing is affordable.

I think our $20/month plans might become a little less generous and the $200/month plan won’t always allow non-stop vibe coding, but I don’t think the prices are going to rise so much that users are priced out. Like Uber, customers will grumble for a while and then adapt to the new normal.

The big difference is that compute hardware is getting better. I think we might overshoot with data center buildouts to the point that compute becomes cheap, while hardware improvements continue to lower the cost of serving models. Over time the same service becomes cheaper to operate, opposite of Uber where driver wages are creeping upward.

eieiewq•7m ago

No it’s a bad example as uber had solid unit economics. Uber is more Akin to Amazon.

bensyverson•6m ago

I would love to see the data to support the idea that Uber had solid unit economics during their expansion.

neals•14m ago

Token cost will come down in the future, it might even out

bensyverson•12m ago

It's hard to predict the future, but it seems likely that flagship tokens will become more expensive, but many people will use free tokens via local inference.

skybrian•7m ago

For Uber, paying the driver is unavoidable (for now) so this isn’t a good comparison at all.

A better comparison is with how much PC costs went down during the 80’s due to IBM clones and Moore’s law.

giancarlostoro•6m ago

I mean SubQ claims to reason as good as the frontier models, not better, but reasonably as good, and is 5x cheaper, so the future might not be all in on overpriced LLMs. Data science devs don't engineer for scalable performance, they engineer for the model's capabilities first and foremost.

SubQ was validated by at least one third party, not sure if we'll see more confirmation, but 5x cheaper costs is worth it. None of the frontier models care enough about cutting costs of their models, only being the best in benchmarks.

https://subq.ai/

dosinga•30m ago

The counter argument (not mine): Software Engineers are willing to spend their own money on AI. The same people that wouldn't pay 10 dollars for code if there was a workaround that took hours.

nubg•28m ago

Interesting perspective

simianwords•24m ago

I did a thought experiment: if you went back to 2019 and could use AI in your job at the current market price - like lets say using the latest Deepseek V4, would you pay for it?

Hell yeah obviously. There's close to no doubt. So why do we think its not true now?

graydsl•15m ago

Am I the only one that has AI or does everyone? If everyone has it what is the thought experiment? Why 2019?

changoplatanero•28m ago

Imagine you were looking at Google, a sustainable and profitable business, and you thought you saw a once in a lifetime opportunity to compete with them and take their position as a leading tech company. How much money would you need to spend to make a credible attempt?

Google has had decades to accumulate intellectual and physical capital. Catching up quickly means spending >500 billion. If you can actually dethrone Google (admittedly not an easy task) then it will have been worth it. If not, I suppose it's wasted investment.

Now what happens when three or four startups vie for this opportunity at once? Well that's how you get $2 trillion in captial investments per year.

jayd16•23m ago

What a strange train of thought. Why would you need that amount in this hypothetical? Why would dethroning them alone be worth it? It would literally only be worth it if you could do so profitably.

More realistically, it seems like someone calculated that it could still be profitable up to several hundreds of billions of dollars which explains the initial investment. And continued investment can be explained by trying to salvage the existing capital spend. But even if it's the best option those companies have now as far as a hypothetical goes, it still might not have been worth it.

doctorpangloss•18m ago

You: says someone has a strange train of thought, and then you also ask how can a company become profitable in a situation where it becomes a monopoly? Dude, the winner raises prices? "AI" is not expensive enough!

fred_is_fred•18m ago

Most disruptive start-ups don't come from a giant pile of cash, but from new ideas that the old players can't or won't adopt. It did not take $500B to build a digital camera.

giancarlostoro•8m ago

Meanwhile Meta / Mark Zuck is in crisis mode trying to come up with some meaningful way to break into AI, instead of competing in what's currently hot, they'll probably remake Elons weird persona system, which is not exactly the hottest thing on the planet, and billions down the drain that could have gone into building more efficient LLMs instead.

ethagnawl•6m ago

> billions down the drain that could have gone into building more efficient LLMs instead.

Or any one of thousands of other ventures which could be more beneficial to humanity, the environment, etc.

hoansdz•26m ago

The cost of tokens used by AI in many fields is even greater than the cost of human services; people are experiencing FOMO, but once the wave passes, the market will stabilize.

KoolKat23•25m ago

The timing of this is great considering Google's rumoured Gemini 3.5 flash pricing spike.

eloisant•21m ago

The thing that everyone seems to be missing is that the US AI companies are focused on the frontier models, that are very expensive for diminishing returns.

If suddenly the money craze stops, meaning (1) AI companies investors want them to become profitable and (2) clients start being cost-sensitive to AI bills (which they are absolutely not currently), then everyone will switch to smaller, cheaper models that are enough for a lot of use case.

Sonnet instead of Opus. GPT 5.4 instead of 5.5.

Chinese models.

People keep comparing to Uber but Uber can't suddenly make it cheaper to operate.

d0100•18m ago

> GPT 5.4 instead of 5.5.

I am exclusively using 5.4 because its only 1x and very good, but the github calculation showed my once $40 become a $680 billing

That is too expensive and not worth paying

czhu12•15m ago

I really respected Ed Zitron, but I feel like he's very much lost the plot on AI.

Scroll back not too far and he was publishing criticisms that no one wants to spend actual money AI. Anthropic has shattered all notions of that since then.

Then there was the idea that even if people want it, we have way too much GPU capacity to ever be saturated. Now almost all providers are hitting limits.

Now, its the next iteration that even if people want to spend money and GPU's are at capacity, its just never going to be profitable. This may or may not be true, especially with more capable open source models that can be served at cost. But at this point, he mostly just brings up anything possible to downplay AI

pingou•11m ago

The end goal of those companies is AGI, or even ASI. If you believe in that, and think they can do the job of a human for less money, you should put all your money into working towards that goal and buying as much compute as you can, especially since whoever gets there first (or is simply ahead and can use their AI to get even better) gets a big advantage.

skybrian•10m ago

I’m paying API prices for my hobby coding due to the coding agent I use. So far I’ve switched from Opus to Sonnet to GLM 5.1. Looks like it’s about 25% of the cost and quality seems good enough so far.

I think competition is going to keep customer costs low if you’re willing to switch. Maybe people on expense accounts won’t care, though?

tomxor•8m ago

FANG et al, sit on trillions in capital..

If they are instead burning this on "hyper-scaling", and the bubble ends, and they have overinvested in AI as this blog post asserts - then they never get a return. OK so far.

Question: In the aftermath "AI" will still have utility, just like after the dotcom bubble. So what would happen to all of the assets and services of these AI companies?. If the cost has already been sunk, would they continue to benefit society? and would owners be incentivised to continue their operation to minimise their losses? Would they be cannibalised for parts? Would they become unviable to even break even at existing operation cost?

I don't understand the economic factors enough to see the possible futures clearly. But my general point is - maybe it's ok if all that capital is spent in this way if it benefits society overall and FANG never see a return. But feel like I am missing other side-effects.

I've built a virtual museum with nearly every operating system you can think of

Apple unveils new accessibility features

I’ve joined Anthropic

Gaussian Splat of a Strawberry

Gentoo News: Copy Fail, Dirty Frag, and Fragnesia Kernel Vulnerabilities

Show HN: Superlog (YC P26) – Observability that installs itself and fixes bugs

OpenBSD 7.9

Hanoi's humble beer glass and the memory of a nation

CISA Admin Leaked AWS GovCloud Keys on GitHub

Intro to TLA+ for the LLM Era: Prompt Your Way to Victory

I Found Ultra-Pure Quantum Crystals in an Abandoned Mine in the Atacama Desert

Why are most humans right-handed? The answer may lie in how we learned to walk

Mini Shai-Hulud Strikes Again: 314 npm Packages Compromised

AI is too expensive

An Apple (II) for Teacher

Show HN: I made a 3D pose maker for artists

Peter Neumann has died

Polypad

Photo GIMP – A Patch for GIMP 3 for Photoshop Users

Nim-Presto – REST API Framework for Nim Language

Cursor Introduces Composer 2.5

Click (2016)

Kv4p HT – A homebrew 1W radio (VHF or UHF) that plugs into an Android phone

RuView – See through walls with WiFi

U.S. Cybersecurity Agency Leaves Its Digital Keys Out in Public on GitHub

Colonization of Venus

Anthropic acquires Stainless

PyTorch Landscape

1024000^2 Blocks, 2B2T Minecraft Server World Download Project, and Discoveries

We let AIs run radio stations

I've built a virtual museum with nearly every operating system you can think of

Apple unveils new accessibility features

I’ve joined Anthropic

Gaussian Splat of a Strawberry

Gentoo News: Copy Fail, Dirty Frag, and Fragnesia Kernel Vulnerabilities

Show HN: Superlog (YC P26) – Observability that installs itself and fixes bugs

OpenBSD 7.9

Hanoi's humble beer glass and the memory of a nation

CISA Admin Leaked AWS GovCloud Keys on GitHub

Intro to TLA+ for the LLM Era: Prompt Your Way to Victory

I Found Ultra-Pure Quantum Crystals in an Abandoned Mine in the Atacama Desert

Why are most humans right-handed? The answer may lie in how we learned to walk

Mini Shai-Hulud Strikes Again: 314 npm Packages Compromised

AI is too expensive

An Apple (II) for Teacher

Show HN: I made a 3D pose maker for artists

Peter Neumann has died

Polypad

Photo GIMP – A Patch for GIMP 3 for Photoshop Users

Nim-Presto – REST API Framework for Nim Language

Cursor Introduces Composer 2.5

Click (2016)

Kv4p HT – A homebrew 1W radio (VHF or UHF) that plugs into an Android phone

RuView – See through walls with WiFi

U.S. Cybersecurity Agency Leaves Its Digital Keys Out in Public on GitHub

Colonization of Venus

Anthropic acquires Stainless

PyTorch Landscape

1024000^2 Blocks, 2B2T Minecraft Server World Download Project, and Discoveries

We let AIs run radio stations

AI is too expensive

Comments