frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What happened to self-hosted models?

3•curiousaboutml•14h ago
Hi HN, sorry for using a burner account.

It seems to me that up until the beginning of the last year, we saw a couple of new "open" model release announcements almost every week. They'd set a new state of the art for what an enthusiast could run on their laptop or home server.

Meta, Deepseek, Mistral, Qwen, even Google etc. were publishing new models left and right. There were new formats, quantizations, inference engines etc. and most importantly - a lot of discourse and excitement around them.

Quietly and suddenly, this changed. After the release of gpt-oss (August 2025), the discourse has been heavily dominated around hosted models now. I don't think I've seen any mention of Ollama in any discussion that reached HN's front page in the last 6 months.

What gives? Is this a proxy signal that we've hit a barrier in LLM efficiency?

Comments

al_borland•14h ago
My wildly uneducated guess is that they are getting to the point where they need to figure out how to profit off all this investment, and releasing self-hosted open-source models isn’t going to help them do that.
curiousaboutml•14h ago
Possibly, but it's not just the release of new models. It seems the community itself has lost interested in self-hosted models.
nacozarina•14h ago
Investors need everyone to avoid self-hosted models and pay premium subscriptions for large centralized models, else they will never earn the profits they want. Self-hosted models spoil their revenue forecasts.
electroglyph•14h ago
there are tons of models released still. even some non-Qwen ones!
bityard•14h ago
HN only covers a very small slice of interesting things that happen in tech every day. If it's your only source of tech news and information, you are missing out on a LOT.

There are plenty of self-hosted models being released all the time, they just don't make it to HN. For that, you need to find a community that is passionate about testing and tinkering with self hosted models. A very popular one is "/r/localllama" on Reddit, but there are a few others scattered around.

doublerabbit•2h ago
Could you recommend other sites? I only use HN exclusively but would be keen on decent tech new sites without having to sieve through the sludge of Google.

TheRegister, SlashDot and hackaday I know of.

gnosis67•13h ago
Ollama has changed. Early versions were raw, and then they were optimized (I’m on a laptop with 64GB RAM), and then they fell to shit. Optimized for someone else’s home rig I suppose.

And my old favorite models broke so I have to link different versions. nous-hermes2-mixtral I miss your sage banter.

Now everything runs on an excessive lag.

softwaredoug•12h ago
One thing that happened was the providers got better at hosting smaller and cheaper models. So you could self host or just get your work done with GPT 5 nano.
potsandpans•10h ago
They're still going. I just bought a 5090 for myself this Christmas to do more interesting things.

I mostly use them for game assets.

Trellis2 is very cool. Ive managed to put together a sdxl -> trellis -> unirig pipeline to generate 3d characters with mixamo skeletons that's working pretty well.

On the llm front, deepseek and qwen are still cranking away. Qwen3 a22b instruct, imho does a better job than gemini in some cases with ocr and translation of handwritten documents.

The problem with these frontier open weight models is that running them locally is not exactly tenable. You either have to get a cloud GPU instance, or go through a provider.

- https://github.com/microsoft/TRELLIS.2 - https://github.com/VAST-AI-Research/UniRig

jaggs•9h ago
There are a lot of local models being released every week. You really need to log into /r/localllama to stay up to date.

Max Payne – two decades later – Graphics Critique

https://darkcephas.blogspot.com/2021/07/max-payne-two-decades-later-graphics.html
1•davikr•5m ago•0 comments

Show HN: Just published a hard-SF novel Voyager1 returns with a quantum palantir

https://www.amazon.com/dp/B0GFSMP572
1•dufbugderopa•10m ago•1 comments

Orca: A New Architecture for Efficient AGI Through Parent-Teacher Learning

https://x.com/EricOmnigenius/article/2009656779945451932
2•ericspecullaas•12m ago•0 comments

A curated list of awesome explorable explanations

https://github.com/blob42/awesome-explorables
2•vitalnodo•15m ago•0 comments

The Declining Value of Personal Advice

https://www.gojiberries.io/the-declining-value-of-interpersonal-advice/
1•neehao•15m ago•0 comments

Show HN: Artdots: The benefits of creating a side project

https://artdots.co/blog/artdots-the-benefits-of-creating-a-side-project
1•veliona•19m ago•0 comments

Nvidia Announces Alpamayo Open-Source AI Models to Accelerate Reasoning-Based AV

https://nvidianews.nvidia.com/news/alpamayo-autonomous-vehicle-development
2•lateforwork•22m ago•0 comments

Ask HN: Before codebase review, replace all vars containing simple with complex?

1•gitprolinux•22m ago•0 comments

Show HN: Umaro – An interactive music theory suite for guitarists

https://www.umaro.app/
1•SnowingXIV•23m ago•0 comments

Zluda run unmodified CUDA on non Nvidia hw

https://www.phoronix.com/news/ZLUDA-CUDA-13.1-Compatibility
2•gigatexal•28m ago•0 comments

"About a decade ago... I developed an automated theorem-proving framework"

https://twitter.com/getjonwithit/status/2009602836997505255
2•Ariarule•28m ago•0 comments

Tool for live presentations using manim

https://github.com/jeertmans/manim-slides
1•vitalnodo•32m ago•0 comments

Workers at Redmond SpaceX lab exposed to toxic chemicals

https://www.fox13seattle.com/video/fmc-w1ga4pk97gxq0hj5
6•SilverElfin•34m ago•0 comments

Ask HN: When has a "dumb" solution beaten a sophisticated one for you?

3•amadeuswoo•46m ago•2 comments

Why some clothes shrink in the wash – and how to 'unshrink' them

https://www.swinburne.edu.au/news/2025/08/why-some-clothes-shrink-in-the-wash-and-how-to-unshrink...
1•OptionOfT•50m ago•0 comments

Show HN: VAM Seek – 2D video navigation grid, 15KB, zero server load

https://github.com/unhaya/vam-seek
5•haasiy•51m ago•0 comments

A curated list of free courses with certifications

https://github.com/cloudcommunity/Free-Certifications
3•javatuts•56m ago•0 comments

Bruno – local and Git-native solution to accelerate and secure API

https://www.usebruno.com/
1•javatuts•57m ago•0 comments

OpenAI is reportedly asking contractors to upload real work from past jobs

https://techcrunch.com/2026/01/10/openai-is-reportedly-asking-contractors-to-upload-real-work-fro...
13•pseudolus•1h ago•0 comments

Datadog, thank you for blocking us

https://www.deductive.ai/blogs/datadog-thank-you-for-blocking-us
34•gpi•1h ago•1 comments

Google moonshot spinout SandboxAQ claims an ex-exec is attempting 'extortion'

https://techcrunch.com/2026/01/09/google-moonshot-spinout-sandboxaq-claims-an-ex-exec-is-attempti...
1•Geekette•1h ago•0 comments

The new vs. used car debate is dead. They're both expensive debt traps

https://washingtonpost.com/business/2026/01/10/1000-payments-car-debt-trap/
6•pseudolus•1h ago•1 comments

Show HN: Reverse-engineering images into model-specific syntax(MJ,Nano,Flux,SD)

https://promptslab.app/image-to-prompt
1•jackzhuo•1h ago•1 comments

Npmgraph – a web-based tool that visualizes NPM package dependencies

https://npmgraph.js.org/
2•javatuts•1h ago•0 comments

Show HN: Hashing Go Functions Using SSA and Scalar Evolution

https://github.com/BlackVectorOps/semantic_firewall
2•BlackVectorOps•1h ago•1 comments

Show HN: mister.jar – Modular MRJAR Files Made Easy

http://lingocoder.com/mrjar/mrjar.usage.html
1•burnerToBetOut•1h ago•0 comments

Culture Isn't Stagnating, You Guys Are Just Old

https://www.jenn.site/culture-isnt-stagnating-you-guys-are-just-old/
7•Analemma_•1h ago•4 comments

Show HN: I made an Android app which sends Health Connect data to your webhooks

https://github.com/mcnaveen/health-connect-webhook
1•mcnx097•1h ago•0 comments

AI is intensifying a 'collapse' of trust online, experts say

https://www.nbcnews.com/tech/tech-news/experts-warn-collapse-trust-online-ai-deepfakes-venezuela-...
7•pseudolus•1h ago•2 comments

Can Walking Be My Whole Workout?

https://www.nytimes.com/2026/01/06/well/move/is-walking-enough-exercise.html
3•thelastgallon•1h ago•1 comments