Discovering Differences in Strategic Behavior Between Humans and LLMs
Caroline Wang, Daniel Kasenberg, Kim Stachenfeld, Pablo Samuel Castro
TL;DR
This work investigates how frontier LLMs differ from humans in strategic play within Iterated Rock-Paper-Scissors (IRPS) by automatically discovering interpretable behavioral models with AlphaEvolve. It combines large-scale human and matched-LLM datasets against a suite of bots and compares against baseline symbolic and neural models. The main contributions are (1) demonstrating higher, earlier exploitative performance by frontier LLMs, (2) showing AlphaEvolve can produce simple, interpretable programs that match predictive performance and reveal that frontier LLMs maintain more complex opponent models, and (3) providing a framework to analyze structural differences between human and LLM strategic behavior that extends beyond modality-specific statistics. The findings illuminate how LLMs may surpass humans in strategic reasoning in IRPS while highlighting limitations in long-horizon reasoning for certain models, informing both behavioral science and AI governance in interactive settings.
Abstract
As Large Language Models (LLMs) are increasingly deployed in social and strategic scenarios, it becomes critical to understand where and why their behavior diverges from that of humans. While behavioral game theory (BGT) provides a framework for analyzing behavior, existing models do not fully capture the idiosyncratic behavior of humans or black-box, non-human agents like LLMs. We employ AlphaEvolve, a cutting-edge program discovery tool, to directly discover interpretable models of human and LLM behavior from data, thereby enabling open-ended discovery of structural factors driving human and LLM behavior. Our analysis on iterated rock-paper-scissors reveals that frontier LLMs can be capable of deeper strategic behavior than humans. These results provide a foundation for understanding structural differences driving differences in human and LLM behavior in strategic interactions.
