frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Nuclear War: An LLM Scenario

https://chrisclapham.com/blog/nuclear-war-an-llm-scenario
21•huey77•7h ago

Comments

roxolotl•2h ago
We don’t need agi or superintelligence for these things to be dangerous. We just need to be willing to hand over our decision making to a machine.

And of course a human can make a wrong call too. In this scenario that’s what is happening. And of course we should bring all of our tools to bear when it comes to evaluating nuclear threats.

But that doesn’t make it less concerning that we’ve now got machines capable of linguistic persuasion in that toolset.

asah•1h ago
"hand over" is a misnomer - what actually happens is that there's an interaction with a machine and people either trust it too much, or forget that it's a machine (i.e. handed from one person to another and the "AI warning" label is accidentally or intentionally ripped off)
chuckadams•1h ago
Would you like to play a game?
laughingcurve•1h ago
The quote is "Shall we play a game?”.

“Would you like to play a game?" is from Saw.

user2722•1h ago
No-one got fired for ~buying IBM~ following a statistical-based text output.
shmeeed•25m ago
In this scenario, ICBMs got fired.
user2722•1h ago
I'd posit the faster we feed LLM exhisting nuclear crisis and invented, dissimilar to its training corpus, nuclear scenarios, the better we will know how wrong they can be. Fear-mongering isn't lucrative, isn't dopamine triggering, isn't actionable, doesn't look good on the resume, so it's tipically ignored.
itintheory•51m ago
> Fear-mongering isn't lucrative, isn't dopamine triggering

Isn't it? Isn't fear-mongering one of the main selling points for news-media? And a driving factor of engagement in social media?

motbus3•1h ago
This is not unlikely. This is actually likely. The instructions for those agents is to find signals that prove there is an attack. Llms are steered to do what they are requested. They will interpret the signals a strongly as possible. They will omit counter evidence to achieve their objective. They will distort analysis to find their objective.

This has been everyone's llm problem daily. How is not that clear yet?

chuckadams•1h ago
I don't disagree, but just to play devils advocate: the LLM can also be told to look for counter-evidence, and will at least make a stab at doing so. That's more than we can expect from the humans currently in charge.
twoodfin•1h ago
Since the beginning of the nuclear age, literally billions of dollars have been spent paying incredibly smart people to model all aspects of nuclear war, including the chain of escalation under uncertainty.

Not to discount the importance of this risk, but we’re not likely to sleepwalk into it, barring a collapse in strategic & operational competence in planning (yeah, yeah) that would make MANY risks dangerously severe.

rglover•1h ago
The big problem here is determining how vigilant those in command are about vetting the AI's responses. This feels like one of those systems that works great until someone vaporizes a hallucinated target that was actually civilians or unintended targets. This should be mitigated by having a MITM, but still. Risky. Humans make mistakes, too, and they're inclined to just "believe what the computer says," so as much as I'd love to believe this ends with a white picket fence scene, my instincts are screaming "dig a bunker, homie."
laughingcurve•1h ago
https://arxiv.org/abs/2509.17192

Shall We Play a Game? Language Models for Open-ended Wargames

Wargames are simulations of conflicts in which participants' decisions influence future events. While casual wargaming can be used for entertainment or socialization, serious wargaming is used by experts to explore strategic implications of decision-making and experiential learning. In this paper, we take the position that Artificial Intelligence (AI) systems, such as Language Models (LMs), are rapidly approaching human-expert capability for strategic planning -- and will one day surpass it. Military organizations have begun using LMs to provide insights into the consequences of real-world decisions during _open-ended wargames_ which use natural language to convey actions and outcomes. We argue the ability for AI systems to influence large-scale decisions motivates additional research into the safety, interpretability, and explainability of AI in open-ended wargames. To demonstrate, we conduct a scoping literature review with a curated selection of 100 unclassified studies on AI in wargames, and construct a novel ontology of open-endedness using the creativity afforded to players, adjudicators, and the novelty provided to observers. Drawing from this body of work, we distill a set of practical recommendations and critical safety considerations for deploying AI in open-ended wargames across common domains. We conclude by presenting the community with a set of high-impact open research challenges for future work

Nobody Gets Promoted for Simplicity

https://terriblesoftware.org/2026/03/03/nobody-gets-promoted-for-simplicity/
325•aamederen•3h ago•184 comments

Glaze by Raycast

https://www.glazeapp.com/
89•romac•2h ago•48 comments

"It Turns Out"

https://jsomers.net/blog/it-turns-out
23•Munksgaard•38m ago•6 comments

Motorola GrapheneOS devices will be bootloader unlockable/relockable

https://grapheneos.social/@GrapheneOS/116160393783585567
1009•pabs3•14h ago•409 comments

Qwen3.5 Fine-Tuning Guide – Unsloth Documentation

https://unsloth.ai/docs/models/qwen3.5/fine-tune
67•bilsbie•3h ago•14 comments

Apple Introduces MacBook Neo

https://www.apple.com/newsroom/2026/03/say-hello-to-macbook-neo/
281•dm•1h ago•271 comments

Chimpanzees Are into Crystals

https://www.nytimes.com/2026/03/04/science/chimpanzees-crystals.html
41•jimnotgym•7h ago•19 comments

The one science reform we can all agree on, but we're too cowardly to do

https://www.experimental-history.com/p/the-one-science-reform-we-can-all
14•sito42•37m ago•2 comments

Libre Solar – Open Hardware for Renewable Energy

https://libre.solar
31•evolve2k•3d ago•8 comments

RFC 9849. TLS Encrypted Client Hello

https://www.rfc-editor.org/rfc/rfc9849.html
180•P_qRs•8h ago•79 comments

RE#: how we built the fastest regex engine in F#

https://iev.ee/blog/resharp-how-we-built-the-fastest-regex-in-fsharp/
116•exceptione•3d ago•44 comments

Greg Knauss Is Losing Himself

https://shapeof.com/archives/2026/2/greg_knauss_is_losing_himself.html
28•wallflower•2d ago•3 comments

Jiga (YC W21) Is Hiring

https://jiga.io/about-us
1•grmmph•3h ago

Charging a three-cell nickel-based battery pack with a Li-Ion charger [pdf]

https://www.ti.com/lit/an/slyt468/slyt468.pdf
12•theblazehen•1d ago•0 comments

Agentic Engineering Patterns

https://simonwillison.net/guides/agentic-engineering-patterns/
321•r4um•10h ago•173 comments

Elevator Saga: The elevator programming game (2015)

https://play.elevatorsaga.com/index.html
55•xmprt•3d ago•8 comments

A CPU that runs entirely on GPU

https://github.com/robertcprice/nCPU
168•cypres•11h ago•85 comments

Bet on German Train Delays

https://bahn.bet
216•indiantinker•5h ago•146 comments

Better JIT for Postgres

https://github.com/vladich/pg_jitter
111•vladich•9h ago•42 comments

Show HN: Stacked Game of Life

https://stacked-game-of-life.koenvangilst.nl/
101•vnglst•3d ago•21 comments

Modern Illustration: Archive of illustration from c.1950-1975

https://www.modernillustration.org
31•eustoria•3d ago•4 comments

Apple Announces Low-Cost 'MacBook Neo' with A18 Pro Chip

https://www.macrumors.com/2026/03/04/apple-announces-low-cost-macbook-neo-with-a18-pro-chip/
46•vanburen•1h ago•12 comments

A Visual Guide to DNA Sequencing

https://www.asimov.press/p/dna-sequencing
5•surprisetalk•1h ago•0 comments

Claude's Cycles [pdf]

https://www-cs-faculty.stanford.edu/~knuth/papers/claude-cycles.pdf
709•fs123•1d ago•298 comments

Graphics Programming Resources

https://develop--gpvm-website.netlify.app/resources/
151•abetusk•13h ago•13 comments

Did Alibaba just kneecap its powerful Qwen AI team?

https://venturebeat.com/technology/did-alibaba-just-kneecap-its-powerful-qwen-ai-team-key-figures...
61•GTP•2h ago•20 comments

Show HN: I made a zero-copy coroutine tracer to find my scheduler's lost wakeups

https://github.com/lixiasky-back/coroTracer
38•lixiasky•1d ago•1 comments

Medical journal says the case reports it has published for 25 years are fiction

https://retractionwatch.com/2026/03/03/canadian-pediatric-society-journal-correction-case-reports...
10•Tomte•28m ago•0 comments

Weave – A language aware merge algorithm based on entities

https://github.com/Ataraxy-Labs/weave
161•rs545837•13h ago•91 comments

Voxile: A ray-traced game made in its own engine and programming language

https://elbowgreasegames.substack.com/p/voxray-games-pushes-major-update
246•spacemarine1•18h ago•65 comments