Ask HN: Could free/low cost LLMs be a momentary thing?

4•senda•12h ago

Say they(OpenAi Etc)don’t find a way to reduce the cost of running these LLMs. Will we shift towards slower/worse LLMs running locally? Or maybe enterprise ones only used by large corporations for specific tasks?

Will the era of using these to generate code end? Is the assume that the inference problem will be solved?

Comments

mullingitover•12h ago

I think it’s very unwise to bet against the advancement of technology.

> Will we shift towards slower/worse LLMs running locally?

Bet on faster/better LLMs running locally and invest/brace yourself for recession accordingly.

verdverm•8h ago

This, I've switched to qwen36moe on a spark and it's on par with gemini-3-flash. It's way better than I expected and in another ~6 months I expect things to be even better for open models of this size.

My long term expectation is that the big labs will build big models primarily for training and distilling to economically viable models. Even if the model capabilities don't plateau so soon, I think the economics of this will.

aurareturn•12h ago

Very clearly, all free LLM chatbots will need to be supported by ads. There is no other way to make them free in a large scale.

xnickb•12h ago

Government funding is another way. In some countries people pay some sort of "media" tax. That can be redistributed. A new one can be added. If LLMs are becoming a standard way to interact and process the data and using them is a social necessity then it is absolutely the job of the state to provide means for it.

aurareturn•12h ago

So basically minimum compute given to people like government health care?

That's what Sam Altman said we should do.

ben_w•11h ago

The rumour mill (justified, given cloud cost of running big open models with similar performance scores) says that these companies make money on inference, but lose it all on training.

So: when the money runs out and the bubble pops, we'll still get cheap existing models, what we lose is the race for new models.

We'd probably even keep free models: I forget where I saw it, but back in the early days someone noticed that models were so cheap that you could generate a decent sized blog post about any topic for about the same as the expected revenue from putting a few adverts on it and having it viewed *exactly once*.

That said, when (/if) these businesses stop chasing new models, it can make sense to burn the weights of the best at that date into a fixed (and analog, given how well they work with only a few bits of precision) circuit, making them more efficient. Not my field, so I'm not sure exactly how much more efficient analog can be; one or two orders of magnitude from what I've heard, but don't hold me to that, not my field.

downbad_•11h ago

Is it possible that at a point in the near future, when everyone is dependent on them, they can then remove all free tiers and pump up the prices?

But that would lead to another competition on prices again.

elnatro•11h ago

Yes. That’s my opinion. I think Apple will leverage their shared RAM and M architectures to sell their computers as local-LLM ready.

They for sure are testing LLMs and checking the performance of local models. Once they reach a performance and quality enough for some tasks they will announce Apple AI or some variation of the name.

All of this is speculation, but I think is obvious the right way.

punyaatloomavi•1h ago

Yes. I believe that. It’s just matter of time they start charging us quite a bit. And we won’t complain as we are so used to using by then

fp.

Ask HN: How to be SOC2 Type 2 compliant as a solo-entreprenuer?

Ask HN: Do you know what data your AI coding agent sends to the cloud?

Ask HN: Could free/low cost LLMs be a momentary thing?

Ask HN: When did computers stop being fun?

Tell HN: Mindie.dev is scraping emails from profiles to send spam

Tell HN: Dont use Claude Design, lost access to my projects after unsubscribing

SlothDB is an OLAP DB ahead of DuckDB on Clickbench SQL database in C++20

Ask HN: What LLM models are you using and why?

Viable open source Claude Design alternative?

Which country will be the first to pass laws limiting Meta Ray-Ban glasses?

Note-taking software,Novel ideas

Ask HN: What are you working on (non-AI)?

Ask HN: Do you still spend time maintaining Claude.md / AGENTS.md files?

Ask HN: Can I take Meta to court for banning business Insta or FB account?

Ask HN: What happened to the movie "Pirates of the Silicon Valley"

Ask HN: What happened to ssh-audit.com?

Load testing in your infra, not cloud

Ask HN: How to be SOC2 Type 2 compliant as a solo-entreprenuer?

Ask HN: Do you know what data your AI coding agent sends to the cloud?

Ask HN: Could free/low cost LLMs be a momentary thing?

Ask HN: When did computers stop being fun?

Tell HN: Mindie.dev is scraping emails from profiles to send spam

Tell HN: Dont use Claude Design, lost access to my projects after unsubscribing

SlothDB is an OLAP DB ahead of DuckDB on Clickbench SQL database in C++20

Ask HN: What LLM models are you using and why?

Viable open source Claude Design alternative?

Which country will be the first to pass laws limiting Meta Ray-Ban glasses?

Note-taking software,Novel ideas

Ask HN: What are you working on (non-AI)?

Ask HN: Do you still spend time maintaining Claude.md / AGENTS.md files?

Ask HN: Can I take Meta to court for banning business Insta or FB account?

Ask HN: What happened to the movie "Pirates of the Silicon Valley"

Ask HN: What happened to ssh-audit.com?

Load testing in your infra, not cloud

Ask HN: Could free/low cost LLMs be a momentary thing?

Comments