Cheap Talking Algorithms
Daniele Condorelli, Massimiliano Furlan
TL;DR
This paper investigates how simple memoryless reinforcement-learning agents play the Crawford–Sobel cheap-talk game in a static, large-population setting. It shows that agents converge to Bayes–Nash equilibria with substantial information transmission when bias is low, and that informativeness declines as bias increases, with equilibria near Pareto-optimal or second-best predictions at intermediate bias. The results hold across a range of hyperparameters and game forms, and the equilibrium selection is largely governed by monotone partitional equilibria, transitioning from the most informative to less informative as $b$ grows. The work has implications for AI agents in strategic settings, suggesting that communication can persist and even shape market-like outcomes, and it outlines several avenues for extending the analysis to population dynamics, networks, and human–AI interactions.
Abstract
We simulate behaviour of two independent reinforcement learning algorithms playing the Crawford and Sobel (1982) game of strategic information transmission. We adopt memoryless algorithms to capture learning in a static game where a large population interacts anonymously. We show that sender and receiver converge to Nash equilibrium play. The level of informativeness of the sender's cheap talk decreases as the bias increases and, at intermediate level of the bias, it matches the level predicted by the Pareto optimal equilibrium or by the second best one. Conclusions are robust to alternative specifications of the learning hyperparameters and of the game.
