frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Why LLMs confidently hallucinate instead of admitting knowledge cutoff?

2•cryptography•1h ago
I asked Claude about a library released in March 2025 (after its January cutoff). Instead of saying smth like "I don't know, that's after my cutoff," it fabricated a detailed technical explanation - architecture, API design, use cases. Completely made up, but internally consistent and plausible.

What's confusing: the model clearly "knows" its cutoff date when asked directly, and can express uncertainty in other contexts. Yet it chooses to hallucinate instead of admitting ignorance.

Is this a fundamental architecture limitation, or just a training objective problem? Generating a coherent fake explanation seems more expensive than "I don't have that information."

Why haven't labs prioritized fixing this? Adding web search mostly solves it, which suggests it's not architecturally impossible to know when to defer.

Has anyone seen research or experiments that improve this behavior? Curious if this is a known hard problem or more about deployment priorities.

Comments

bigyabai•1h ago
> Yet it chooses to hallucinate instead of admitting ignorance.

LLMs don't "choose" to do anything. They inference weights. Text is an extremely limiting medium, and doesn't afford LLMs the distinction between fiction and reality.

barrister•49m ago
If I ask Grok about anything that occurred this morning, he immediately starts reading and researching in real time. "Summarize what Leavitt said this morning." "Tell me what's new in python 3.14." Etc.. What do you mean by "cutoff", it seems unlikely that Claude is that limited.

Are AI Builders the Final Abstraction Layer?

https://fortune.com/2025/09/29/anthropic-releases-claude-sonnet-4-5-a-model-it-says-can-build-sof...
1•rkwap•2m ago•1 comments

Big AI firms pump money into world models as LLM advances slow

https://arstechnica.com/ai/2025/09/big-ai-firms-pump-money-into-world-models-as-llm-advances-slow/
1•merksittich•4m ago•0 comments

AI background removal can now also clean the subject of reflected colours

https://imgprocessor.net/
1•cristiancc•5m ago•1 comments

Instant Checkout in ChatGPT

https://stripe.com/newsroom/news/stripe-openai-instant-checkout
1•jmtulloss•5m ago•0 comments

Constant-Depth NTT for FHE-Based Private Proof Delegation

https://pse.dev/blog/const-depth-ntt-for-fhe-based-ppd
1•badcryptobitch•6m ago•0 comments

Sonnet 4.5 ranks #25 (below other Claude models) in generating SQL

https://www.tinybird.co/ai
1•enether•7m ago•1 comments

Zipoc: Git, but super lightweight and simpler and with more features (WIP)

https://github.com/jimmydin7/zipoc
1•jimmydin7•9m ago•0 comments

Claude Sonnet 4.5 autonomously generates Slack clone in one shot in 30 hours

https://www.theverge.com/ai-artificial-intelligence/787524/anthropic-releases-claude-sonnet-4-5-i...
1•yodon•9m ago•3 comments

Show HN: Resrap – A Parser but in Reverse

https://resrap.osdc.dev/
3•itsarnavsh•10m ago•2 comments

Diagnosing a Linux Performance Regression

https://automattic.com/2024/03/14/systems-report-linux-performance-regression/
2•program•14m ago•0 comments

Is Mainstream Tech News Dead?

https://rumble.com/v6zjkny-is-mainstream-tech-news-dead.html
1•mikece•15m ago•0 comments

What if there's an overinvestment in AI?

https://www.mbi-deepdives.com/what-if-theres-an-overinvestment-in-ai/
2•akyuu•17m ago•0 comments

A blue jay and a green jay mated. Their offspring is a scientific marvel

https://www.cnn.com/2025/09/29/science/blue-jay-green-jay-hybrid
3•Kaibeezy•20m ago•0 comments

Gemini API Down

https://twitter.com/OfficialLoganK/status/1972729571868086327
2•ekojs•24m ago•0 comments

NZ universities give up using software to detect AI in students' work

https://www.rnz.co.nz/news/national/574517/universities-give-up-using-software-to-detect-ai-in-st...
4•billybuckwheat•26m ago•0 comments

Vibe Working: Introducing Agent Mode and Office Agent in Microsoft 365 Copilot

https://www.microsoft.com/en-us/microsoft-365/blog/2025/09/29/vibe-working-introducing-agent-mode...
6•prossercj•27m ago•1 comments

Woman admits UK Bitcoin fraud charges after ' largest' crypto seizure

https://www.theguardian.com/uk-news/2025/sep/29/zhimin-qian-admits-uk-bitcoin-charges-after-world...
2•mindracer•28m ago•0 comments

Claude Plays Catan [video]

https://www.youtube.com/watch?v=BER3EhUIyz0
2•GabrielBianconi•29m ago•0 comments

Diffusion Cam: img2text2img social media

https://www.diffusion.cam/
2•arjvik•30m ago•0 comments

Researchers find tree growth boosts insect herbivory

https://phys.org/news/2025-09-tree-growth-boosts-insect-herbivory.html
2•PaulHoule•30m ago•0 comments

100X Faster: How We Supercharged Netflix Maestro's Workflow Engine

https://netflixtechblog.com/100x-faster-how-we-supercharged-netflix-maestros-workflow-engine-028e...
2•emschwartz•31m ago•0 comments

How to Use an AWS S3 Bucket as a Pulumi State Back End

https://nelson.cloud/how-to-use-an-aws-s3-bucket-as-a-pulumi-state-backend/
2•speckx•32m ago•0 comments

Tile Language: DSL for High-Performance GPU/CPU/Accelerators Kernels

https://github.com/tile-ai/tilelang
3•lukax•34m ago•1 comments

Small Near-Earth Objects in the Taurid Resonant Swarm

https://arxiv.org/abs/2509.22602
2•bikenaga•35m ago•0 comments

Show HN: Open-Source Configurable AI Agents for Company Research

https://github.com/DimiMikadze/Mira
2•DimitriMikadze•36m ago•0 comments

Agentic-Commerce-Protocol

https://github.com/agentic-commerce-protocol/agentic-commerce-protocol
3•vettyvignesh•37m ago•0 comments

Help Me Find Missing Issues of Australian Personal Computer

https://blog.decryption.net.au/posts/apc-callout.html
1•naves•38m ago•0 comments

LoongArch Reference Manual

https://loongson.github.io/LoongArch-Documentation/LoongArch-Vol1-EN.html
1•welovebunnies•40m ago•0 comments

The new light of Jony Ive's life

https://www.wallpaper.com/design-interiors/lighting/jony-ive-lovefrom-balmuda-sailing-lantern
3•Nrbelex•41m ago•2 comments

US to See $350B Nuclear Boom to Power AI

https://www.bloomberg.com/news/articles/2025-09-29/us-to-see-350-billion-nuclear-boom-to-power-ai...
1•aanet•42m ago•4 comments