Nash Equilibrium and Learning Dynamics in Three-Player Matching $m$-Action Games
Yuma Fujimoto, Kaito Ariu, Kenshi Abe
TL;DR
This work extends the classic two-player Matching Pennies to a three-player, multi-action framework ($m$-3MA) and provides a complete Nash equilibrium analysis, showing symmetry $x^*=y^*=z^*$ and a spectrum of equilibria (uniform, pure, double-roots). It then studies learning dynamics under continuous-time Follow the Regularized Leader (FTRL) with entropic and Euclidean regularizers, introducing the Lyapunov-like measure $V$ to capture synchronization among players. The dynamics, governed by three-parameter interactions $(oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{}}}}}}}$), exhibit cycling, convergence to two-action equilibria, or heteroclinic cycles depending on the signs of $oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{}}}}}}}$ and $oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{oldsymbol{}}}}}}}$, with $eta$ shaping rotational aspects. The results illuminate how triadic interactions shape global learning behavior and point to future work on memory and last-iterate convergence in multi-agent games.
Abstract
Learning in games discusses the processes where multiple players learn their optimal strategies through the repetition of game plays. The dynamics of learning between two players in zero-sum games, such as Matching Pennies, where their benefits are competitive, have already been well analyzed. However, it is still unexplored and challenging to analyze the dynamics of learning among three players. In this study, we formulate a minimalistic game where three players compete to match their actions with one another. Although interaction among three players diversifies and complicates the Nash equilibria, we fully analyze the equilibria. We also discuss the dynamics of learning based on some famous algorithms categorized into Follow the Regularized Leader. From both theoretical and experimental aspects, we characterize the dynamics by categorizing three-player interactions into three forces to synchronize their actions, switch their actions rotationally, and seek competition.
