Hallucination Is Inevitable: An Innate Limitation of Large Language Models

11•drob518•1h ago

Comments

bell-cot•1h ago

IANAL, nor expert in this space.

But might any such care to comment on the consequences, if this "it is impossible, even in theory, to eliminate LLM hallucinations" result holds up?

hilariously•37m ago

They are lossy statistical prediction machines - to eliminate hallucinations effectively eliminates the lossy part and you might as well just use predicates in a database of facts.

Jiro•45m ago

From that abstract it doesn't sound like they allowed for the possibility that the LLM could be trained to say "I don't know" for some things.

dwallin•27m ago

Yeah, they only “proved” hallucination is inevitable by defining it to be any case where the llm doesn’t provide the “correct” answer. By this definition, an LLM deciding not to answer is also a “hallucination”.

jaclebus81•19m ago

My intuition on this is like training a classifier on four classes: dog, cat, cow and IDK. It feels intuitive to us but really hard to do in practice. In the classifier case, we are leveraging a subset of data to train the model to give correct answers to unseen data. If we want the model to generalize to unseen data we need it to call unseen dog-like things a dog. If not, then all unseen dogs would be IDK. Learning that boundary of "known vs unknown" is very hard. If done poorly, you have a model that cannot abstract to anything that is not in the dataset which is a huge part of what makes these models so impressive. I'm sure there is more to it than this but I does not surprise me that it is an unsolved problem.

thomastjeffery•39m ago

Describing it as a limitation is the problem. Hallucination is the core feature. It's the only thing they do!

whythismatters•37m ago

>Submitted on 22 Jan 2024 (v1), last revised 13 Feb 2025 (this version, v2)

stevefan1999•35m ago

LLM or transformers just merely extracting signals from human text and build a "contextualized" predictor over a long sequence of words sorted by the information (technically it is attention) of each token, then generate sentences that way, one by one into other sequences at a time.

But the biggest problem is, even human itself is subjectable to hallucination. That is called being delusional, or being drugged. So it is inevitable from the first principle.

prewett•13m ago

Humans hallucinate, in the LLM sense, all the time. Did that sign really say that? Nope, I just extrapolated from the first three letters. In the Cambrian Explosion article on HN this morning, I thought the first line said that the earth was desolate. The second line didn't match up with that idea, so I read it again, and the first line said the opposite of what I thought. I particularly hallucinate things into emails from people at work that I disagree with, so much so that I've learned to wait until the next day to reply, and usually I find that they didn't say what I thought they said.

red75prime•29m ago

They prove that no finite amount of training data is enough to extrapolate an adversarially constructed non-continuous function. It's something akin to the no free lunch theorem (NFL).

No one uses the NFL to "prove" that LLMs can't learn to be the best optimizers, because it also proves that people can't be the best optimizers, but we manage somehow, so the theorem is irrelevant.

This is a fallacy of proving too much.

Blueprint Bench: First signs of 3D spatial intelligence in LLMs

Verifying Poseidon in Clean: Why the Last 'Sorry' Is About Primality

The Record of a Sonnet Drift

Stop big tech from making users behave in ways they don't want to

Oomphalism

Uutils Coreutils CVEs

Load balancing usage across Codex accounts

1Mbet

Drop a Pin, Get a Link

Ask HN: Why is sharing private static HTML with non-engineers still hard?

How Russia Is Luring Africans to Ukraine

Making Fuel from Thin Air: The Magical Methane Machine

Future of Work with AI Agents

Young Men Are Going to Extremes to Feel Like They Measure Up

My new hobby: Asking LLMs to generate ASCII Hamsters

Tailoring AI solutions for health care needs

macOS port of Notepad++ called out for trademark violation

Tesla reaches 10B FSD miles – is there's a magical milestone for autonomy

The Visible Zorker: Zork 3

Show HN: Retrodex – Retro game collection tracker and game encyclopedia

What is the whole point of writing

Show HN: Show HN: Writer – fast, lightweight and open source markdown editor

UAE says it's under attack from Iranian missiles and drones despite ceasefire

Load Testing for SFTP, FTP, and FTPS

VR Coding for the AI Coding Era – Monitoring 5 AI Agents at Once

PGKeeper: Figma's Postgres connection pooler Renaissance era

Ctify_: A lightweight, PHP-based wiki, forked from PmWiki

Wikimedia Foundation closes Wikinews after 21 years

Robot dogs with tech boss faces roam Berlin art exhibit

KeePassχ – A KeePassXC Fork