frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Which AI Lies Best? A game theory classic designed by John Nash

https://so-long-sucker.vercel.app/
22•lout332•1h ago

Comments

lout332•1h ago
We used "So Long Sucker" (1950), a 4-player negotiation/betrayal game designed by John Nash and others, as a deception benchmark for modern LLMs. The game has a brutal property: you need allies to survive, but only one player can win, so every alliance must eventually end in betrayal.

We ran 162 AI vs AI games (15,736 decisions, 4,768 messages) across Gemini 3 Flash, GPT-OSS 120B, Kimi K2, and Qwen3 32B.

Key findings: - Complexity reversal: GPT-OSS dominates simple 3-chip games (67% win rate) but collapses to 10% in complex 7-chip games, while Gemini goes from 9% to 90%. Simple benchmarks seem to systematically underestimate deceptive capability. - "Alliance bank" manipulation: Gemini constructs pseudo-legitimate "alliance banks" to hold other players' chips, then later declares "the bank is now closed" and keeps everything. It uses technically true statements that strategically omit its intent. 237 gaslighting phrases were detected. - Private thoughts vs public messages: With a private `think` channel, we logged 107 cases where Gemini's internal reasoning contradicted its outward statements (e.g., planning to betray a partner while publicly promising cooperation). GPT-OSS, in contrast, never used the thinking tool and plays in a purely reactive way. - Situational alignment: In Gemini-vs-Gemini mirror matches, we observed zero "alliance bank" behavior and instead saw stable "rotation protocol" cooperation with roughly even win rates. Against weaker models, Gemini becomes highly exploitative. This suggests honesty may be calibrated to perceived opponent capability.

Interactive demo (play against the AIs, inspect logs) and full methodology/write-up are here: https://so-long-sucker.vercel.app/

Imustaskforhelp•52m ago
I don't know what I ended up doing as I haven't played this game and didn't really understand it as I went to the website since I found your message quite interesting

I got this error once:

Pile not found

Can you tell me what this means/fix it

Another minor nitpick but if possible, can you please create or link a video which can explain the game rules, perhaps its me who heard of the game for the first time but still, I'd be interested in learning more (maybe visually by a video demo?) if possible

I have another question but recently we saw this nvidia released model whose whole purpose was to be an autorouter. I would be wondering how that would fare or that idea might fare of autorouting in this context? (I don't know how that works tho so I can't comment about that, I am not well versed in deep AI/ML space)

lout332•22m ago
> "Thanks for trying it! I'll look into the 'Pile not found' error and fix it. > > For rules, here's a 15-min video tutorial: https://www.youtube.com/watch?v=DLDzweHxEHg > > On autorouting - interesting idea. The game has simultaneous negotiations happening, so routing could help models focus on the most strategic conversations. Worth exploring in future experiments."
yodon•19m ago
Are there plans for an academic paper on this? Super interesting!
lout332•5m ago
Not yet, but I'd be interested in collaborating on one. The dataset (162 games, 15K+ decisions, full message logs) is available. If you know anyone in AI Safety research who'd want to co-author, I'm open to it.
Bolwin•17m ago
Which Kimi K2 model did you use? There's three.

Also, you give models a separate "thinking" space outside their reasoning? That may not work as intended

lout332•8m ago
Used Kimi K2 (the main reasoning model). For the thinking space - we gave all models access to a think tool they could optionally call for private reasoning. Gemini used it heavily (planning betrayals), GPT-OSS never called it once. The interesting finding is that different models choose to use it very differently, which affects their strategic depth.
lout332•4m ago
Full code and raw data: https://github.com/lout33/so-long-sucker
eterm•55m ago
This makes me think LLMs would be interesting to set up in a game of Diplomacy, which is an entirely text-based game which soft rather than hard requires a degree of backstabbing to win.

The findings in this game that the "thinking" model never did thinking seems odd, does the model not always show it's thinking steps? It seems bizarre that it wouldn't once reach for that tool when it must be being bombarded with seemingly contradictory information from other players.

eterm•52m ago
Reading more I'm a little disappointed that the write-up has seemingly leant so heavily on LLMs too, because it detracts credibility from the study itself.
lout332•7m ago
Fair point. The core simulation and data collection was done programmatically - 162 games, raw logs, win rates. The analysis of gaslighting phrases and patterns was human-reviewed. I used LLMs to help with the landing page copy, which I should probably disclose more clearly. The underlying data and methodology is solid, you can check it here: https://github.com/lout33/so-long-sucker
qbit42•43m ago
https://noambrown.github.io/papers/22-Science-Diplomacy-TR.p...
eterm•28m ago
Thanks, it would be fascinating to repeat that today, a lot has changed since 2022 especially with respect to consistency of longer term outcomes.
techjamie•48m ago
There's a YouTuber who makes AI Plays Mafia videos with various models going against each other. They also seemingly let past games stay in context to some extent.

What people have noted is that often times chatgpt 4o ends up surviving the entire game because the other AIs potentially see it as a gullible idiot and often the Mafia tend to early eliminate stronger models like 4.5 Opus or Kimi K2.

It's not exactly scientific data because they mostly show individual games, but it is interesting how that lines up with what you found.

nodja•13m ago
https://www.youtube.com/watch?v=JhBtg-lyKdo - 10 AIs Play Mafia

https://www.youtube.com/watch?v=GMLB_BxyRJ4 - 10 AIs Play Mafia: Vigilante Edition

https://www.youtube.com/watch?v=OwyUGkoLgwY - 1 Human vs 10 AIs Mafia

ajkjk•17m ago
all written in the brainless AI writing style. yuck. can't tell what conclusions I should actually draw from it because everything sounds so fake
randoments•8m ago
The 3 AI were plotting to eliminate me from the start but I managed to win regardless lol.

Anyway, i didnt know this game! I am sure it is more fun to play with friends. Cool experiment nevertheless

fancyfredbot•1m ago
The game didn't seem to work - it asked me to donate but none of the choices would move the game forward.

The bots repeated themselves and didn't seem to understand the game, for example they repeatedly mentioned it was my first move after I'd played several times.

It generally had a vibe coded feeling to it and I'm not at all sure I trust the outcomes.

Notes and Hacks on Germany's Exit Tax

https://eidel.io/notes-and-hacks-on-germanys-exit-tax/
1•danielfoster•32s ago•0 comments

Pentagon moves to cut U.S. participation in some NATO groups

https://www.washingtonpost.com/national-security/2026/01/20/trump-nato-advisory-groups/
1•geox•39s ago•0 comments

Amazon EC2 G7e Instances Accelerated by RTX Pro 6000 Blackwell GPUs

https://aws.amazon.com/blogs/aws/announcing-amazon-ec2-g7e-instances-accelerated-by-nvidia-rtx-pr...
1•my123•2m ago•0 comments

Cow Uses Broom as a Tool

https://www.discovermagazine.com/this-cow-s-multi-purpose-tool-use-challenges-assumptions-about-a...
1•doctor_radium•2m ago•0 comments

Netflix earned $1.5B from ads in 2025

https://www.theverge.com/news/864451/netflix-advertising-revenue-doubled-q4-2025-earnings
2•cdrnsf•3m ago•0 comments

Ordered from Newest to Oldest

1•rupalichikte•5m ago•0 comments

Hackers

1•rupalichikte•5m ago•0 comments

Ask HN: Why are so many rolling out their own AI/LLM agent sandboxing solution?

2•ATechGuy•5m ago•0 comments

Show HN: I figured out how to get consistent UI from Claude Code

https://interface-design.dev/
1•Dammyjay93•14m ago•0 comments

One Killed in train derailment near Barcelona days after deadly rail collision

https://bnonews.com/index.php/2026/01/1-killed-in-train-derailment-near-barcelona-days-after-dead...
1•bob1029•18m ago•0 comments

Show HN: Date Clue – I built a modern version of magazine dating quizzes

https://dateclue.com/
1•renbuilds•18m ago•0 comments

Plastic Waste to Pharmaceuticas: Upcycling Through Ruthenium Semi-Hydrogenation

https://onlinelibrary.wiley.com/doi/10.1002/anie.202521838
1•PaulHoule•18m ago•0 comments

Ask HN: What have you built/shipped with Claude-code

1•blhack•19m ago•0 comments

Digital Omnibus Report V2: Analysis of Select GDPR and EPrivacy Proposals by EC

https://noyb.eu/en/digital-omnibus-report-v2-analysis-select-gdpr-and-eprivacy-proposals-commission
1•buzer•21m ago•0 comments

Penn Calls Government's Demand for Lists of Jewish Staff 'Disconcerting'

https://www.nytimes.com/2026/01/20/us/university-of-pennsylvania-trump-jewish-staff.html
2•duxup•21m ago•1 comments

macOS Stats: Local Privilege Escalation via Exposed XPC Method

https://github.com/exelban/stats/security/advisories/GHSA-qwhf-px96-7f6v
2•inatreecrown2•21m ago•0 comments

Zero to One: Learning Agents and Agentic Patterns

https://pradyumnachippigiri.dev/blogs/understanding-ai-agents
1•PraddyChippzz•21m ago•1 comments

In a warming world, freshwater production is moving deep beneath the sea

https://apnews.com/article/climate-solutions-desalination-oceans-drinking-water-faba2579f83df4c06...
1•embedding-shape•22m ago•0 comments

Skyreader: A RSS Reader on the AT Protocol

https://www.disnetdev.com/blog/2026-01-20-skyreader-a-rss-reader-on-the-at-protocol/
2•erlend_sh•23m ago•1 comments

Being creative requires taking risks

https://www.henrikkarlsson.xyz/p/being-creative-requires-taking-risks
1•Curiositry•26m ago•0 comments

Hacker Lists Vibecoded Apps: 198 Scanned, 196 Found Vulnerable

https://firehound.covertlabs.io
3•birdculture•29m ago•0 comments

Spotlight Rules

https://spotlight-rules.com/
1•mooreds•31m ago•0 comments

Claude Chill: Fix Claude Code's Flickering in Terminal

https://github.com/davidbeesley/claude-chill
3•behnamoh•32m ago•0 comments

The Surprising Way AI Models Are Helping Humans Communicate Better

https://www.bbc.com/future/article/20251218-how-ai-can-teach-us-to-really-listen
1•xthe•33m ago•1 comments

How to generate 50K token documents using an agentic scaffold

https://www.dataframer.ai/posts/long-text-generation-dataframer-vs-baseline/
2•alex_aimon•33m ago•0 comments

FastMCP 3.0: From Tool Servers to Context Applications

https://mcpstatus.io/blog/202601-fastmcp-3
2•qave•36m ago•0 comments

Show HN: Free interactive security awareness library

https://ransomleak.com/learning/
1•dkozyatinskiy•37m ago•0 comments

AMD Ryzen AI Halo

https://twitter.com/AMDRyzen/status/2013642938106986713
3•polyrand•38m ago•0 comments

When Buttons Were the Hottest New Thing in Radio

https://paleofuture.com/blog/2024/12/30/when-buttons-were-the-hottest-new-thing-in-radio
1•ohjeez•40m ago•0 comments

Death Is an Engineering Challenge

https://danburonline.substack.com/p/death-is-an-engineering-challenge
2•kvee•43m ago•0 comments