frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Why are LLM's made intentionally non-deterministic?

5•mojuba•1d ago
Just found out that the main factor that introduces non-determinism in LLM's answers is the so called temperature that adds variation ("creativity") into selecting the winning token each time.

This might be good for creative tasks, but may as well become an obstacle in adopting LLM's for certain other tasks. Among other things, it makes LLM-based systems untestable.

Program code is deterministic for a reason (if you forget about physical defects and cosmic particles occasionally hitting microchips and flipping some bits). If your code is supposed to control a nuclear power plant you better have a testable system with proven correctness.

So why aren't the users of LLM's given some sort of a fader to control temperature with an option to have 100% determinism? Who decided that we shouldn't have control over this parameter and why?

Comments

subject4056•1d ago
Among other reasons, if you turn temperature down to 0, llms stop working. Like they don't give natural language answers to natural language questions any more, they just halt immediately. Temperature gives the model wiggle room to emit something plausible sounding rather than clam up when presented an input that wasn't verbatim in the training data (such as the system prompt).
mojuba•1d ago
Yes but that doesn't explain why we aren't given a choice. Program code is boringly deterministic but in many cases it's exactly what you need while non-determinism becomes your dangerous enemy (like in the case of some Airbus jets being susceptible to bit flips under cosmic rays)
dtagames•1d ago
The current way to address this is through RAG applications or Retrieval Augmented Generation. This means using the LLM side for the natural language non-deterministic portion and using traditional code and databases and files for the deterministic part.

A good example is bank software where you can ask what your balance is and get back the real number. A RAG app won't "make up" your balance or even consult the training the find it. Instead, the traditional code (deterministic) operations are done separately from the LLM calls.

Cr8•1d ago
unfortunately disabling temperature / switching to greedy sampling doesn't necessarily make most LLM inference engines _fully_ deterministic as parallelism and batching can result in floating point error accumulating differently from run to run - it's possible to make them deterministic but does come with a perf hit

some providers _do_ let you set the temperature, including to "zero", but most will not take the perf hit to offer true determinism

dtagames•1d ago
In fact, nondeterminism goes all the way down to the training level where every token and weight starts as a random number, a concept I wrote about in this intro to how AI works article [0].

The reason for this is that randomness is crucial to getting emergent data out of a system. Those are unexpected, unpredictable, but often useful results. This is how an LLM can answer a question it's never been asked before.

We have had deterministic databases forever so there would be no AI advance if LLMs were just databases. The AI models of the 1960's tried that very approach, called rules based and it doesn't work. We can never come up with or write down all the rules. The failure of those methods lead to the "AI winter" and no further real progress in AI until the invention of transformers at Google.

[0] https://levelup.gitconnected.com/something-from-nothing-d755...

Show HN: A 134-Modality AGI Core Built in Python (Velocity_Nova_Prime)

1•AISovereignDev•41s ago•0 comments

US seizes Russian-flagged oil tanker in Atlantic after two-week pursuit

https://www.theguardian.com/world/2026/jan/07/russia-submarine-escort-shadow-fleet-tanker-us-sanc...
1•sorokod•1m ago•0 comments

RFK Jr announces nutrition guidelines, with more protein and less processed food

https://www.cnbc.com/2026/01/07/rfk-jr-nutrition-guidelines-protein-processed-food.html
3•koolba•2m ago•0 comments

Burnout is breaking a sacred pact

https://usefulfictions.substack.com/p/burnout-is-breaking-a-sacred-pact
1•piinbinary•4m ago•0 comments

Man sues restaurant after promotional TikTok video exposes his secret affair

https://www.dexerto.com/tiktok/man-sues-restaurant-after-promotional-tiktok-video-exposes-his-sec...
1•randycupertino•4m ago•0 comments

Show HN: I spent 12 months building a conversational agent for social media

https://www.postreach.ai/
1•John_V•4m ago•0 comments

The $14 Burrito: Why San Francisco Inflation Feels Higher Than 2.5%

https://www.foglinesf.com/p/the-14-burrito-why-san-francisco-inflation-feels-higher-than-2-5
2•KothuRoti•5m ago•0 comments

Show HN: FailWatch – A fail-closed circuit breaker for AI agents

https://github.com/Ludwig1827/FailWatch
1•Sheeplover•6m ago•1 comments

FireAid Los Angeles 2025: Mutual Aid Maps at the Speed of Feeling [video]

https://www.youtube.com/watch?v=1wy2NCHFepU
1•johanam•6m ago•0 comments

EVs remain a niche choice in the US, according to survey

https://arstechnica.com/cars/2026/01/evs-remain-a-niche-choice-in-the-us-according-to-survey/
1•pseudolus•6m ago•0 comments

The Haskell Debugger for GHC 9.14

https://discourse.haskell.org/t/the-haskell-debugger-for-ghc-9-14/13499
1•cosmic_quanta•7m ago•1 comments

Tin, Git for agentic coding conversations

https://github.com/dadlerj/tin
1•dadlerj•8m ago•1 comments

Using Marquez as a lineage tool for Celery (2024)

https://blog.dataengineerthings.org/a-fun-experiment-using-marquez-as-a-lineage-tool-for-celery-e...
1•aeontech•9m ago•0 comments

Asimov Cascade

https://aimode.substack.com/p/asimov-cascade
1•warthog•10m ago•0 comments

Show HN: ZK-auctions – experimenting with zero-knowledge sealed-bid auctions

https://github.com/ndrwnaguib/zk-auctions
1•ndrwnaguib•11m ago•0 comments

Pundits Talking Their Book (2015)

https://www.bankers-anonymous.com/blog/pundits-talking-their-book/
1•walterbell•11m ago•0 comments

Tying the Masterpoint Knot

https://masterpoint.io/tying-the-masterpoint-knot/
1•mooreds•12m ago•0 comments

AMD's EPYC Venice, Instinct MI455X, & Helios Hardware on Display for First Time

https://www.servethehome.com/amds-epyc-venice-instinct-mi455x-helios-hardware-on-display-for-firs...
1•rbanffy•12m ago•0 comments

Show HN: One Card Tarot – AI-powered one-card tarot for quick daily insight

https://onecardtarot.org
1•xiaoshumiao•14m ago•0 comments

Aldrich Ames, CIA agent who sold secrets to the Soviets, dies aged 84

https://www.bbc.com/news/articles/cj0n29395j3o
2•mudil•14m ago•0 comments

Show HN: Luther Enterprise: Dev platform for operating end-to-end mega workflows

https://enterprise.luthersystems.com/
3•iamsamwood•15m ago•0 comments

Show HN: A unified search engine that delivers 3 Wiki articles and 1YT per query

https://pli7.vercel.app/
1•seridarivus•16m ago•0 comments

Task-free intelligence testing of LLMs

https://www.marble.onl/posts/tapping/index.html
1•amarble•16m ago•0 comments

Vector Fields in Augmented Reality

https://a-sumo.github.io/posts/visualizing-vector-fields-on-ar-glasses/
1•rslice•16m ago•0 comments

Versioned Package.swift Files

https://www.massicotte.org
3•maguszin•17m ago•0 comments

Cryogenic Etch: A Key Enabler of 3D NAND

https://semiengineering.com/cryogenic-etch-a-key-enabler-of-3d-nand/
2•rbanffy•18m ago•0 comments

Fractional Product Design Lead

https://www.yantso.com/
1•Yantso•18m ago•1 comments

Show HN: ElevenPage Reader – Chrome Extension to read your page with ElevenLabs

https://github.com/aedrax/ElevenPage-Reader
1•aedrax•18m ago•0 comments

Where 'hate speech' censorship is worse than on U.S. campuses

https://www.washingtonpost.com/opinions/2026/01/06/hate-speech-censorship-campus-european-union-b...
1•whatisabcdefgh•18m ago•0 comments

Socure achieved a 50% cost reduction by migrating to Amazon EMR Serverless

https://aws.amazon.com/blogs/big-data/how-socure-achieved-50-cost-reduction-by-migrating-from-sel...
1•mooreds•19m ago•0 comments