Synchronization in Learning in Periodic Zero-Sum Games Triggers Divergence from Nash Equilibrium
Yuma Fujimoto, Kaito Ariu, Kenshi Abe
TL;DR
This work studies learning in periodic zero-sum games where the Nash equilibrium moves over time. By casting gradient-descent-ascent as a linear dynamical system and analyzing its eigenstructure, the authors show a synchronization phenomenon: when the average learning speed, captured by the eigenvalue $\alpha$, aligns with the game frequency $\omega$ (i.e., $\alpha/\omega=1$), the time-average of players’ strategies diverges away from the time-average Nash equilibrium; otherwise the dynamics form complex cycles but their time-average converges. The results are derived in a 2×2 setting via the eigenvalue invariant game and extended to general 2×2 smooth periodic games, with further experiments confirming robustness to higher action counts, boundary constraints, non-smooth waves, and polymatrix extensions. This reveals a universal resonance-driven mechanism in learning dynamics under time-varying environments, with implications for tracking evolving equilibria and designing algorithms robust to seasonal or cyclical changes.
Abstract
Learning in zero-sum games studies a situation where multiple agents competitively learn their strategy. In such multi-agent learning, we often see that the strategies cycle around their optimum, i.e., Nash equilibrium. When a game periodically varies (called a ``periodic'' game), however, the Nash equilibrium moves generically. How learning dynamics behave in such periodic games is of interest but still unclear. Interestingly, we discover that the behavior is highly dependent on the relationship between the two speeds at which the game changes and at which players learn. We observe that when these two speeds synchronize, the learning dynamics diverge, and their time-average does not converge. Otherwise, the learning dynamics draw complicated cycles, but their time-average converges. Under some assumptions introduced for the dynamical systems analysis, we prove that this behavior occurs. Furthermore, our experiments observe this behavior even if removing these assumptions. This study discovers a novel phenomenon, i.e., synchronization, and gains insight widely applicable to learning in periodic games.
