1 pointby homarp9 months ago2 comments

AutistiCoder9 months ago
I got ADHD so here's the short and sweet as explained to me by (ironically) ChatGPT:
they essentially subjected various LLMs including GPT and DeepSeek-R1 to the prisoner's dilemma and changed the circumstances to see if those models would adapt, or would stick to using an existing solution.
the LLMs adapted.
Notably, these LLMs aren't actually programmed to do that, making this adaptability most impressive.
homarp9 months ago
study of LLM strategy when playing an iterated prisoner’s dilemma game (100 rounds)
Players include 6 predetermined strategies (Always Cooperate, Always Defect, Random, Win-Stay Lose-Switch, Tit for Tat, Grim Trigger) and a number of LLMs.