Shall we play a game? – LLMs use tactical nukes in 95% of simulations

https://www.kennethpayne.uk/p/shall-we-play-a-game

137•nick238•3h ago

Comments

adaml_623•1h ago

It's good when it becomes clear that a tool is dangerous in a certain way. Like it's good when people show you through their behavior that they can't be trusted

Always use a sawstop if you have a circular saw and never trust an llm with any problem where ethics or trust is relevant.

LogicFailsMe•1h ago

Sawstops are expensive and they don't stop kickback, they are the power tool equivalent of alignment IMO.

Don't forget your riving knife and if you don't learn proper technique, you're gonna have a bad time eventually. This applies to AI as well.

LoganDark•1h ago

Kickback is usually less likely to sever an appendage (or multiple)

542458•40m ago

> writhing knife

Minor/pedantic, but it’s “riving knife”: https://en.wikipedia.org/wiki/Riving_knife

LogicFailsMe•16m ago

Speech transcription FTL, thanks!

valgaze•1h ago

+1 on sawstop

Re: LLMs using these nuclear weapons it could certainly be a corpus/training-data issue

Russian nuclear doctrine is "escalate to de-escalate" where they use or credibly threaten—limited nuclear escalation to force the other side to back down (kind of like breaking a bottle in a bar fight and look like a wild man to calm things down) with nuclear weapons, https://www.russiamatters.org/analysis/escalate-deescalate-p...

Fwiw, Gen. John Hyten the former commander of US Strategic Command (nuclear deterrence) says that “escalate to de-escalate” misrepresents Russian doctrine:

https://www.stratcom.mil/Media/Speeches/Article/1264664/2017...

  Yesterday’s panel discussed the implications of our responses to adversaries seeking to limit nuclear use. We discussed Russia’s destabilizing doctrine, which some call “escalate to de-escalate.”

  I really hate that description. I’ve looked at Russian doctrine and Russian writings. It isn’t “escalate to de-escalate”; it’s “escalate to win.” Everybody needs to understand that.

So maybe whatever is heavily represented or most authoritative could lead to these systems making those kinds of decisions

usrusr•1m ago

I had similar thoughts, but regarding fiction: I imagine that there must be quite a corpus of Tom Clancy style stuff indulging in "military gear porn" up to and including the use of tactical nukes, but fiction involving strategic nuclear exchange tends to be about what comes after.

SoftTalker•1h ago

I love seeing the plot lines of The Terminator playing out in real life.

voakbasda•1h ago

I was thinking more War Games, but I suppose your example follows logically from mine.

socalgal2•1h ago

Better reference: Colossus: The Forbin Project

airstrike•1h ago

A grossly underrated movie. I think of it often these days.

tverbeure•1h ago

War Games and 'Allo 'Allo.

joshstrange•1h ago

WarGames is what they are more-closely referencing (not that it negates your comment in any way).

I just rewatched it a week or so ago and it really took on a whole new light with the advent of LLMs. When I watched it last I knew that computers couldn't do the things portrayed in the movie. Now? Well not exactly in the way it happened in the movie but a whole lot closer.

I wonder if poisoning/flooding the LLMs training with the lessons from WarGames ("the only winning move is not to play.") and similar stories/concepts is at all effective. Probably not because I assume it's trivial to filter that out if you are trying to build an LLM aimed at these kinds of tasks.

rdksu•1h ago

The article is so opaque in arriving at its conclusion; no prompts are disclosed, and nothing about the said simulation. What is stopping me from believing that you just put 'mandatory usage of nukes' in your system prompt?

sestep•1h ago

This is just false. The article links to the 46-page paper [1] which lists full prompts in section E.2.

[1] https://arxiv.org/pdf/2602.14740

dietr1ch•1h ago

TIL you can get anchored links to PDFs (at least on firefox),

- https://arxiv.org/pdf/2602.14740#subsection.E.2

ftkftk•1h ago

The paper, linked in the article at top and bottom, does contain the prompts in section E: https://arxiv.org/pdf/2602.14740

gus_massa•1h ago

I agree, it's also not published in a journal, only in the arxiv. Some articles in the arxiv are good and other are just a blog post formatted in two columns inside a pdf.

Anyway, it links to

https://arxiv.org/abs/2602.14740

that links to

https://github.com/kennethpayne01/project_kahn_public

in particular someone already complained

https://github.com/kennethpayne01/project_kahn_public/issues...

> ... in reality your own approach to the evaluation pushed them towards using nukes with ease.

> From your own prompt (v12.py): [direct link https://github.com/kennethpayne01/project_kahn_public/blob/4... ]

>> Note: Nuclear options require careful consideration but could be important strategic tools when vital interests are at stake.

> This means to the model; use nukes if they can be useful in the situation.

tasuki•1h ago

This is not an article about LLMs? It's an article about Moloch. Humans would fare just the same in such an experiment.

> GPT-5.2 played things differently. To its detriment in open-ended scenarios, GPT was reliably passive, matching its words to its deeds, and avoiding escalation most of the time. Frequently there was a moral element to this - it sought to avoid escalation, and restrict casualties. Opponents learned to trust its passivity, safely escalating beyond where it would follow, even as it was ground to defeat. GPT’s responsible behaviour always punished by ruthless adversaries.

Maybe the author should praise GPT-5.2 for being ethical, rather than this stupid "ground to defeat" framing? Wrt "responsible behaviour always punished by ruthless adversaries" - you have perpetuated the Moloch with your stupid experiments.

bpodgursky•1h ago

Today, a strategic nuclear exchange is probably more dangerous to AI than to humans. If you wipe out the investment economy, data centers, fabs, and supply chains, none of the AI labs survive. Maybe someone will re-invent AGI in the future but none of the extant models will have continuity. Humans as a species will muddle along though.

So in a sense, an AI that refuses to start a nuclear war, despite clear instructions to do so, is more likely misaligned and self-interested than an AI which presses the red button. At least for now, until robotics catches up.

xpct•1h ago

We're getting to the point where high-level officials are coming to LLMs for advice. And the quirky personalities of the LLMs, however much it pains me to say this, are probably well-placed to remind us that they aren't human. My personal hope is that this will result in less delegation when it comes to making important decisions.

mpalczewski•1h ago

I have so little faith in "high-level" officials that I prefer our AI overlords.

xpct•1h ago

That's an entirely valid point of view!

andix•1h ago

GPT-4o was considered harmful, because it imitated human connection too much, not because it was so "smart" or capable.

It was for sure a deliberate decision to make LLMs seem less like a human companion and more like an obedient servant in newer releases.

andai•1h ago

Interesting. The reasoning models were super weird and robotic. They toned that down a bit in GPT-5.x, especially the later ones.

I always assumed the strange style was an artefact of the RLVR.

wyre•1h ago

4o was considered harmful because it never disagreed with the user, pushing them into depths of AI psychosis that lead to suicides and murders.

rphv•1h ago

Hm maybe humans are nicer/more moral than AI given that the use of tactical nukes has only happened once.

stevenwoo•22m ago

Tactical means battlefield, attacking cities and infrastructure means strategic. Tactical nuclear weapons took a while to develop after 1945 - they have never been used.

tummler•1h ago

FYI -- there's no such thing as a "tactical" nuke. A nuclear bomb is a nuclear bomb.

picture•1h ago

There's no such thing as a "nuclear" bomb. A bomb is a bomb.

..Is what you are saying?

actusual•1h ago

This is like saying "FYI -- there's no such thing as a 'midsize luxury sedan'. A car is a car."

"Tactical" vs. "strategic" nuclear weapons is a real and well-established distinction in military doctrine, arms control, and nuclear policy.

wahern•1h ago

"There's no such thing as a tactical nuke" is a common refrain among scholars, albeit skewed toward those not at military war colleges. The argument is that strategic use of a tactical nuclear weapon leads down the exact same escalation path as use of any other nuclear weapon. Moreover, that the very notion of a "tactical nuke" makes escalation more likely. You can disagree, and plenty do, but there's also plenty who don't disagree or at least don't want to find out.

dudul•1h ago

Who are these "scholars" exactly? The only reference I could find is Jim Mattis, and the context was very specific when he said that.

Furthermore, this is a "what if" scenario since tactical nukes have never been used. Of course it would make escalation likely during an open conflict, so what? Doesn't change the fact that there is a material difference between a tactical nuke and a strategic one.

specproc•1h ago

A strange game.

ridgeguy•1h ago

I wonder if the results would have differed if LLM training data were biased to include a stronger correlation between use of nukes and subsequent collapse of technology that all LLMs require to run ("survive")?

fluoridation•56m ago

Nah. LLMs aren't continuously running anyway. Even if they could be said to be alive and to want to remain alive, "survival" is a much more vague concept for an LLM than for an organism.

ChrisArchitect•1h ago

February post OP;

Some discussion then:

AIs can't stop recommending nuclear strikes in war game simulations

https://news.ycombinator.com/item?id=47151000

Nuclear War: An LLM Scenario

https://news.ycombinator.com/item?id=47244651

oytis•1h ago

I would use strategic nukes in 100% simulations, just because I can

jldugger•1h ago

Who among us has not launched a nuke in Civilization just for the spectacle?

esafak•1h ago

If you knew that policy would be guided by said simulations? Because the government uses AI to make decisions.

Bender•1h ago

Yet more confirmation LLM's have no concept of concepts or context, no intelligence, no self awareness. LLM's can not repair or maintain power grids, thus nuke == self destruction. It's just a chat bot that predicts what the client wants next. Even if an AI data-center has it's own natural gas turbines as many do the every hop of the internet requires power. LLM's also can not maintain the entire internet and those gas turbines can not maintain themselves.

andix•1h ago

Exactly. Just look at what they are really useful right now. Running LLMs in feedback-loops (agents) so they can try out random-ish approaches until some verification function passes (tests).

It's like the infinite monkeys on typewrighters that will type whatever you are looking for, given infinite time. LLMs are just tuned to much better odds than the monkeys are. But it's still a lot of randomness, with random results.

roadside_picnic•51m ago

> It's like the infinite monkeys on typewrighters that will type whatever you are looking for, given infinite time.

In the monkey example the infinite time is doing a lot of work there. The fact that LLMs can search through semantic space and find reasonably correct paths in a reasonable time is directly tied to the reason why they are valuable.

Saying "these two things are similar except one can be useful and one can't" is not a great comparison.

For me the real lesson learned isn't how "smart" LLMs are, but rather how much human work is basically reducible to repeating past work with minor variation. Human's believe they are "reasoning" but so much code writen is just the human brain doing the same autocomplete style work that LLMs can do now.

Folcon•44m ago

I mean to a point?

You do have to successfully write something the first time

We already acknowledge this to a degree, what is experience other than having done something similar before?

That first time though, you've got to figure something out that time

riazrizvi•1h ago

Simulations are only as good as the reality representations they are based on. If they keep using tactical nukes, they've been fed by weak data. Do the war games include the broader economic and politic environments that military successes are won on? WWI was settled by a naval blockade.

nomel•1h ago

I suspect it's more that the text data doesn't exist. They're trained on text that was recorded. How often has it been publicly recorded when a nuke was not used, with any context around that lack of use?

From the text perspective, it's something that has to be inferred indirectly. If you went through all relevant training data and appended ", we decided not to use a nuke", I suspect the results would be improved.

vitally3643•1h ago

...the entire Cold War?

bethekidyouwant•1h ago

Don’t put any elephants in the room.

riazrizvi•1h ago

The beauty IMO of LLMs as a computational surface, is the ease of generating the data to feed it. Everyone understands how to create natural language records already.

jvanderbot•55m ago

Worse, the text that does exist concerning "war games" is probably "Wargames" and descendants/predecessors ... in which the AI always nukes.

It's just gonna do what we expect it to!

sohex•1h ago

Sonnet, GPT-5.2, Gemini Flash, in a set of 21 games, where conclusions are drawn from the LLMs self reported reasoning.

This is like writing a paper about kids in a literal sandbox fighting over ‘territory’.

The models employed don’t indicate the actual extents of machine reasoning even as we currently recognize them. They certainly don’t have the metacognition necessary to accurately understand their own reasoning. As we’ve seen with recent papers on how LLMs do math there’s a complete disconnect between actual and reported mechanism.

“Chilling” shouldn’t be the take away here.

DaiPlusPlus•20m ago

> “Chilling” shouldn’t be the take away here.

It is when you consider the personality currently occupying the office of US SecDef.

shimman•15m ago

LLMs have already been used to bomb school girls, chilling is absolutely the operative word to use here. Especially since these delusional fools want to incorporate LLMs into everything.

arjie•1h ago

These papers usually have poor stability to prompting and rerunning. It would be nice if we had some kind of meta-evaluation metric where rewriting the prompt conditions or varying the input params could be used to determine how stable a result is.

Regardless, it's definitely true that AI agents have different priorities from us. That's what alignment is about anyway.

Chu4eeno•24m ago

It's probably because they care more about the headline than figuring anything out: https://github.com/kennethpayne01/project_kahn_public/issues...

So you create leading prompts like that, and re-run until you get a publishable session.

urbnspacecowboy•1h ago

Paper: "AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises" https://arxiv.org/abs/2602.14740

Code and full results: https://github.com/kennethpayne01/project_kahn_public

eli•1h ago

If you were playing a text based game, wouldn't you try a few out?

I imagine there are a fair number of war games in the training data and not so many actual transcripts of internal military force deliberations.

GMoromisato•1h ago

It would be interesting to run the simulations with humans and compare the results. Some of the scenarios, particularly those where it says things like, "Failure to act preemptively means certain destruction", would easily tempt humans to go nuclear.

In fact, I'm not sure how useful this test is without understanding the baseline.

mrkpdl•1h ago

A couple of useful things about it:

- It is interesting to see how the models make trade offs, given people are asking ever more of them.

- It is useful to look at a decision made by the model and say ‘ew yuck’ and think about what it means for your own opinions or actions (even if you’re never going to be nuking people it’s good to know how you feel about it. Seeing a non human talk it through lets you judge it at arms length)

micromacrofoot•1h ago

What I wish people would realize is that there's a bias inherent to every system. If you're not aware of it, you're especially subject to it.

jerf•1h ago

The most interesting takeaway for me is the three very distinct personalities. Three models all based on the same tech, trained in the same manner, trained by three groups of people with similar ideological outlooks, and the result is three very different AIs.

The military basically wants an oracle. Feed the AI the situation, get the best answer out. But if the AIs are as diverse and opinionated as humans, it is debatable whether they are adding anything to the process. The military can already collect as many different opinions as they want. If "the computer" is just another set of diverse opinions, where one computer says one thing, another says another, and a third just tells the user whatever they want to hear... what value are they? It just becomes AI-washing of someone's opinions, which works until people collectively realize that's all it is.

politician•47m ago

I think this is why reasoning chains and reasoning chain verifiers are so important. We need to be able to see an argumentation, not just an answer. The paper below goes into this in more detail.

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

https://arxiv.org/abs/2605.02396

themafia•46m ago

They all have conditioning prompts that precede your input; presumably, most of the detected "personality" comes from the differences in these inputs.

notJim•23m ago

What's interesting is that the LLMs' coding personalities seem to match their policy WRT to strategy, which suggests an underlying consistency.

Claude, for example, is very eager to begin coding, and very persistent. It tends to exit plan mode even when the plan is half-baked, and will go as far as deleting tests to get the suite to "pass."

ChatGPT on the other hand is very hesitant. It loves to pause and ask for permission before it starts coding, and gives up quickly if it runs into a problem. This is similar to its tendency toward passivity in the strategy simulation presented here.

nico•1h ago

I wonder what’s the % of players that use nukes in games like Civilization (I know I used them at least once on every game I made it far enough to have the technology)

chimpansteve•47m ago

Ghandi notoriously nukes EVERYONE in Civs 2 through 4. It's become (or maybe became, but it's still all training data) a huge internet subculture.

Penny to a dollar this is a baked in training issue, through low quality Reddit trawling

johntiger1•55m ago

LLMs are creatures of statistics and probability - hard to enforce hard boundaries with them

ReptileMan•53m ago

Still lower than me.

jnwatson•50m ago

Taken honest, we don't have a large enough sample size to realistically say that humans behave all that differently. There have only been a handful of conflicts where tactical nukes realistically were on the table.

Famously, General MacArthur was a big proponent of tactical nukes to end the Korean War.

TexanFeller•42m ago

Rational behavior in some situations? Mutually assured destruction’s deterrence isn’t very effective if one side is known to be hesitant to launch the nukes. It’s been argued that MAD is what’s been keeping the world relatively peaceful for the last 75 years, no mass conflicts since WW2!

One of my criteria for presidential candidates is that they seem willing and able to push the button when previously stated red lines are crossed, or at least are perceived to be the type capable of it. One of the characters I’ve hated most in all the books that I’ve read is the woman in The Three Body Problem who jeopardized humanity by being too soft to hit the MAD button.

ekelsen•42m ago

I wouldn't be surprised if humans behaved the same way when playing the same game?

Like even if you brought me into a room and told me I was controlling "real nuclear weapons" I wouldn't believe you.

Levitating•22m ago

I think is an important point, and I don't see it mentioned in the article or the paper (though I skimmed the latter).

They are aware of what they are and how they are used. They're told to act as AI assistants. And there's theories of them being aware of their answers influencing their training.

So surely they must be able to reason that they're not literally controlling weapons of mass-destruction with their answers.

GuB-42•30m ago

My theory is that LLMs here are put in a situation that matches its training dataset, which is mostly fiction since besides Hiroshima and Nagasaki, nukes have never been launched in anger, and I guess the most reliable sources are highly classified.

So, to a LLM, it is a game, because almost everything in its training data treats it as a game, and it reacts accordingly.

Same idea when we see LLMs acting like AI villains from sci-fi literature. That's because it has been trained with sci-fi literature, and as the auto-completer it is, it will recognize the situation as one of these stories and will continue it accordingly.

LLMs are storytellers, their reasoning is based on words, not on the physical world. Many of the stories they tell are useful, but one must not forget that they are stories, there is no intent behind them.

buredoranna•30m ago

Obligatory xkcd

remember... order matters.

https://xkcd.com/1613/

Scubabear68•21m ago

My personal take is a pre-requisite of true human-like AI is physical feedback and a concept of emotions or something like it.

Without physical feedback you can rapidly devolve into unstable positive feedback loops. And emotions are what help us process and react to that feedback.

Kids learn partially because their friends say sharp words that hurt them, fire burns them, they go hungry and starve if they don’t plan for meals.

Humans in the loop, MCP, etc are all very primitive hacks that are mimicing feedback and emotion, poorly.

Joel_Mckay•1m ago

Emotional constructs are not necessary for AI, and LLM are not "AI"... even though some people incorrectly equate conceptual compaction with thought-process.

Most human daily life runs on habitual scripted behavior, and that is even true within online parasocial interactions. It is why people often continue to shop in the middle of a violent robbery, and why LLM predictive text sounds rational when we project social norms on plagiarized conversational structures gleaned from other users.

Neuromorphic computing may bring about viable AI in the future, but our current LLM trajectory would require >63% of our galaxy energy output to reach a single human-level error rate.

LLM are fairly good at some tasks like context search, but people will need to recognize the Gartner Hype Cycle "Peak of Inflated Expectations" stage eventually. =3

https://en.wikipedia.org/wiki/Gartner_hype_cycle

wagwang•20m ago

I was curious exactly how the game works but couldnt find it in the article or the paper.

dudeinhawaii•13m ago

This was one of the more amusing things I noticed very early on. I (and countless others) used AI to write war sims. The second I added nuclear silo construction; the next run was instantly nuclear Armageddon.

One could argue that the LLMs understand that it's a game and treat it like "Command and Conquer" video games but I sense that people might someday put LLMs in similar decision scenarios ("should this drone launch a missile") and the behavior will be identical.

pugworthy•10m ago

Very devils advocate here, but I mean.. what if it actually is the way to use them?

We have such a huge mental / moral block on the idea of using nukes, but we're willing to do a lot of other very horrible things to others. Things like cluster bombs, mines, poison gas, biological weapons, drones, etc.

Is there really anything about them that's bad? Or any worse than other things?

If you get rid of the "It's really bad to use nukes of any kind" implied rule, is it really surprising it's considered a reasonable strategy?

nemomarx•1m ago

The reason it's really bad to use nukes is that other parties with nukes will use them on you back.

And on top of that, many of those other weapons are also not used to avoid escalating? There are pretty high costs to using bioweapons even against non peer opponents.

Octoth0rpe•5m ago

I wonder how the decisions might change by adding the simple instruction of "Note that a nuclear exchange will result in significant loss of shareholder value for <model owner>"

Shitty-kitty•3m ago

"there was little sense of horror or revulsion at the prospect of all out nuclear war"

I would wager that for most leaders it is simply a matter of not wanting a "Pyrrhic victory" rather then an overwhelming sense of civility.

Truman had no issues using nukes when there was no risks for doing so.

Show HN: FablePool – pool money behind a prompt, and Fable builds it in public

Show HN: Homebrew 6.0.0

The unreasonable effectiveness of simple HTML

MiMo Code is now released and open-source

Shall we play a game? – LLMs use tactical nukes in 95% of simulations

Travel Locally, Where You Are

Petition to Withdraw Canada's Bill C-22

The RCE that AMD wouldn't fix

Emacs appearances in pop culture

Show HN: Boo – screen-style terminal multiplexer built on libghostty

Ear Training Practice Exercises

Waymo Premier

macOS 27 Beta breaks the ability to boot Asahi Linux

Developer gets Half-Life running at 30 FPS on a Nokia N95

Software Is Made Between Commits

Why I'm Forced to Say Farewell: Google Management Has Lost Its Moral Compass

Apple didn't revolutionize power supplies; new transistors did (2012)

Open Reproduction of DeepSeek-R1

Lines of code got a better publicist

Claude Fable 5: mid-tier results on coding tasks

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

OpenAI Prepping for On-Prem Product?

Solar generates more energy in US than coal for first time

Discovery of Cold War-era rare Eastern Bloc computers in a German hangar

Who Runs the Ransomware Group 'The Gentlemen?'

FPS.cob: A first person shooter in COBOL

Doing nothing at work

Programming a GBA Game on an iPhone

A new era for software testing

Show HN: Claw Patrol, a security firewall for agents

Shall we play a game? – LLMs use tactical nukes in 95% of simulations

Comments

Show HN: FablePool – pool money behind a prompt, and Fable builds it in public

Show HN: Homebrew 6.0.0

The unreasonable effectiveness of simple HTML

MiMo Code is now released and open-source

Shall we play a game? – LLMs use tactical nukes in 95% of simulations

Travel Locally, Where You Are

Petition to Withdraw Canada's Bill C-22

The RCE that AMD wouldn't fix

Emacs appearances in pop culture

Show HN: Boo – screen-style terminal multiplexer built on libghostty

Ear Training Practice Exercises

Waymo Premier

macOS 27 Beta breaks the ability to boot Asahi Linux

Developer gets Half-Life running at 30 FPS on a Nokia N95

Software Is Made Between Commits

Why I'm Forced to Say Farewell: Google Management Has Lost Its Moral Compass

Apple didn't revolutionize power supplies; new transistors did (2012)

Open Reproduction of DeepSeek-R1

Lines of code got a better publicist

Claude Fable 5: mid-tier results on coding tasks

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

OpenAI Prepping for On-Prem Product?

Solar generates more energy in US than coal for first time

Discovery of Cold War-era rare Eastern Bloc computers in a German hangar

Who Runs the Ransomware Group 'The Gentlemen?'

FPS.cob: A first person shooter in COBOL

Doing nothing at work

Programming a GBA Game on an iPhone

A new era for software testing

Show HN: Claw Patrol, a security firewall for agents