Why domain specific LLMs won't exist: an intuition

https://simianwords.bearblog.dev/why-domain-specific-llms-wont-exist-an-intuition/

4•simianwords•3h ago

Comments

scrpgil•3h ago

The author assumes specialization only happens at the model layer. But there's a third option: general model + specialized context.

I built an MCP server that feeds a user's real schedule, tasks, and goals into Claude/ChatGPT. The model isn't specialized — but the output is, because the context is. No fine-tuning, no domain-specific training. Just structured data at inference time.

Domain-specific LLMs won't exist not because specialization is useless, but because it's cheaper to specialize the input than the model.

nickpsecurity•2h ago

We're already using domain-specific LLM's. The only LLM trained lawfully that I know of, KL3M, is also domain-specific. So, the title is already wrong.

https://www.kl3m.ai/

Author is correct that intelligence is compounding. That's why domain-specific models are usually general models converted to domain-specific models by continued pretraining. Even general models, like H20's, have been improved by constraining them to domain-supporting, general knowledge in a second phase of pretraining. But, they're eventually domain specific.

Outside LLM's, I think most models are domain-specific: genetics, stock prices, ECG/EKG scans, transmission shifying, seismic, climate, etc. LLM's trying to do everything are an exception to the rule that most ML is domain-specific.

simianwords•2h ago

> We're already using domain-specific LLM's. The only LLM trained lawfully that I know of, KL3M, is also domain-specific. So, the title is already wrong.

This looks like an "ethical" LLM but not domain specific. What is the domain here?

> That's why domain-specific models are usually general models converted to domain-specific models by continued pretraining

I've also wondered this, like with the case of the Codex model. My hunch is that a good general model trumps a pretrained model by just adding an appropriate system prompt. Which is why even OpenAI sorta recommends using GPT-5.4 over any Codex model.

teleforce•2h ago

>Why domain specific LLMs won’t exist: an intuition

>We would have a healthcare model, economics model, mathematics model, coding model and so on.

It's not the question whether there ever will be specialized model, rather it's the matter of when.

This will democratize almost all work and profession, including programmers, architects, lawyers, engineers, medical doctors, etc.

For half-empty glass people, they will say this is a catastrophe of machine replacing human. On the other hand, the half-full glass people will say this is good for society and humanity by making the work more efficient, faster and at a much lower cost.

Imagine instead of having to wait for a few months for your CVD diagnostic procedures due to the lack of cardiologist around the world (facts), the diagnostics with the help of AI/LLM will probably takes only a few days instead with expert cardiologist in-the-loop, provided the sensitivity is high enough.

It's a win-win situation for patients, medical doctors and hospitals. This will lead to early detection of CVDs, hence less complication and suffering whether it's acute or chronic CVDs.

The foundation models are generic by nature with clusters HPC with GPU/TPU inside AI data-center for model training.

The other extreme is RAG with vector databases and file-system for context prompting as the sibling's comments mentioned.

The best trade-off or Goldilocks is the model fine-tuning. To be specific it's the promising self-distillation fine-tuning (SDFT) as recently proposed by MIT and ETH Zurich [1],[2]. Instead of the disadvantages of forgetting nature of the conventional supervised fine-tuning (SFT), thr SDFT is not forgetful that makes fine-tuning practical and not wasteful. The SDFT only used 4 x H200 GPU for fine-tuning process.

Apple is also reporting the same with their simple Smself-distillation (SSD) for LLM coding specialization [3],[4]. They used 8 x B200 GPU for model fine-tuning, which any company can afford for local fine-tuning based on open weight LLM models available from Google, Meta, Nvidia, OpenAI, DeepSeek, etc.

[1] Self-Distillation Enables Continual Learning:

https://arxiv.org/abs/2601.19897

[2] Self-Distillation Enables Continual Learning:

https://self-distillation.github.io/SDFT.html

[3] Embarrassingly simple self-distillation improves code generation:

https://arxiv.org/abs/2604.01193

[4] Embarrassingly simple self-distillation improves code generation (185 comments):

https://news.ycombinator.com/item?id=47637757

Ask HN: Folks with disabilities, what is it like in this LLM/scraper age?

Andy Weir Apologizes to 'Star Trek' for Calling Shows 'S–': 'Trying to Be Funny'

China Creates New Aviation Mystery with Offshore Warning Zones

UFO Time Travel Physics and the Nature of Consciousness

Paste your writing, see which sentences lose readers

Gemma 4 Uncensored (autoresearch results)

Injectable peptides touted as new fountain of youth. But the science isn't there

Hitachi Ltd, Part I – By Bradford Morgan White

Ask HN: Where are all the disruptive software that AI promised?

China Built the World's Drone Industry. Now It's Locking Down the Skies.

Is All Software Converging

IRL Streaming Map

AI agents pay USDC for API data via x402 micropayments – no API keys

Tuple for Linux

A Textual widget for beautiful diffs in the terminal

Why Over-Engineering Happens

PS3 emulator makes Cell CPU breakthrough that improves performance in all games

Do you remember usability testing?

Agent Governance Toolkit: Open-source runtime security for AI agents

The Melanesian: Dark-skinned people with blonde hair region of Oceania

OpenNMC is an open network management card platform for APC SmartSlot UPS units

A all CLIs tokens and context reducer by 97%

How we feel about AI (2025)

Show HN: Gecit – DPI bypass using eBPF sock_ops, no proxy or VPN

How to Get Better at Guitar

Iran internet blackout now longest nation-scale shutdown on record

Show HN: Stablemount, a response to EmDash, a prototype for a future CMS

Watch 'S4 – The Bob Lazar Story' online: Here's where to watch the UFO doc

Show HN: YardSard – Inventory Management

Show HN: Imladri – Cryptographic enforcement and semantic monitoring for your AI