they essentially subjected various LLMs including GPT and DeepSeek-R1 to the prisoner's dilemma and changed the circumstances to see if those models would adapt, or would stick to using an existing solution.
the LLMs adapted.
Notably, these LLMs aren't actually programmed to do that, making this adaptability most impressive.
homarp•4h ago
Players include 6 predetermined strategies (Always Cooperate, Always Defect, Random, Win-Stay Lose-Switch, Tit for Tat, Grim Trigger) and a number of LLMs.