frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Who is using local LLMs in a production environment here?

6•Haeuserschlucht•4h ago
I'm asking because it seems that nobody really does that. Yes, there are some projects here and there, but ultimately everybody just jumps over to cloud LLMs. Everything is cloud. People pay for GPU usage somewhere in the middle of nowhere. But nobody really uses local LLMs long term. They say, "Well, it's so great. Local LLMs work on small devices they even work on your mobile phone."

I have to say there's one exception for me and that's Whisper. I actually do use Whisper a lot. But I just don't use local LLMs. They're just really, really bad compared to cloud GPUs.

And I don't know why, because for me it seems that having a speech-to-text model is much more challenging to create than just a model that creates text.

But it seems that they really cannot remove the differences and have it run on consumer computers. And so I also go back to cloud LLMs, all privacy aside.

Comments

websiteapi•36m ago
things are changing too quickly for it to be worth it yet. eventually LLMs won't really increase in capability or resources anymore, and at that point, if the hardware itself isn't becoming more optimized for LLM workloads, you'd see people do this.
halJordan•6m ago
The federal government, especially the dod, has adopted local llms. Now, they also have the big iron closed models "locally" so that stretches your definition I'm sure. But they use other models too

I'm having the worst career winter of my life

17•mariogintili•1h ago•18 comments

Ask HN: Who is using local LLMs in a production environment here?

6•Haeuserschlucht•4h ago•2 comments

Tell HN: Happy New Year

437•schappim•2d ago•205 comments

How to use AI to augment learning without losing critical thinking skills?

13•mintsuku•11h ago•12 comments

I optimised my vibe coding tech stack cost to $0

7•udit_50•5h ago•7 comments

Ask HN: When do we expose "Humans as Tools" so LLM agents can call us on demand?

36•vedmakk•21h ago•27 comments

A quantum-resistant RNG powered by collective human entropy

4•EntropyGrid•6h ago•0 comments

Security breaks during partial failures – design notes from distributed systems

6•sandhyavinjam•14h ago•1 comments

I'm building a 30k‑line V12 codebase solo with a "team" of 4 AIs

7•garylauchina•6h ago•4 comments

Ask HN: Building a tool to ensure things get done on time

3•Vishal19111999•12h ago•2 comments

Ask HN: What did you read in 2025?

335•kwar13•1w ago•444 comments

Ask HN: How did you learn to code?

26•chistev•1d ago•74 comments

Ask HN: Why is Apple's voice transcription hilariously bad?

7•keepamovin•17h ago•4 comments

Ask HN: Loneliness at 19, how to cope?

61•yresting•4d ago•105 comments

Ask HN: What is the best microVMs for AI agents?

9•zfoong•1d ago•8 comments

Ask HN: How Are You Handling Auth in 2026?

10•joshcsimmons•22h ago•14 comments

Ask HN: Is being hungry enough to win?

7•meysamazad•1d ago•4 comments

Ask HN: Any example of successful vibe-coded product?

79•sirnicolaz•2d ago•117 comments

I built a public skill registry and MCP server so Codex can install new skills

3•iluxu•1d ago•0 comments

Ask HN: Which AI productivity tools are you using in 2026?

4•Vishal19111999•12h ago•0 comments

Semantica – Open-source semantic layer and GraphRAG framework

8•kaifahmad1•1d ago•0 comments

Ask HN: Does reading HN make you happy?

47•yakattak•2d ago•38 comments

Ask HN: How to do a Personal Cybersecurity audit

24•preciousoo•3d ago•12 comments

Tell HN: Happy New Year!

4•realberkeaslan•1d ago•2 comments

Ask HN: How long before the first civilian cargo flights are AI piloted?

3•givemeethekeys•1d ago•13 comments

Ask HN: How did you make yourself more marketable?

11•ronbenton•2d ago•13 comments

Happy New Year HN!

11•thunderbong•1d ago•4 comments

A curated directory of open-source AI projects

12•doanbactam•3d ago•2 comments

Ask HN: How to go back to listening to MP3s?

9•muratsu•5d ago•25 comments

TP-Link only works with a permanent internet connection

8•roscas•3d ago•7 comments