Beyond Symmetry in Repeated Games with Restarts
Henry Fleischmann, Kiriaki Fragkia, Ratip Emin Berker
TL;DR
The paper addresses cooperation in anonymous pairwise interactions by extending repeated games with restarts to asymmetric settings, where agents may rematch with new partners after deviations. It introduces hazing periods and goal sequences to sustain stable equilibria, and develops a formal framework to compare these sequences via Pareto-optimality, welfare maximization, and limit-utility fairness. The authors prove that computing minimum hazing sequences is (weakly) NP-hard, but show a pseudo-polynomial-time dynamic programming algorithm when the goal sequence maximizes social welfare; they also extend the model to random role reassignment and discuss existence results in various game classes. This work advances the design and analysis of cooperative protocols under anonymity, revealing both the potential gains from asymmetric strategies and the computational barriers to achieving them.
Abstract
Infinitely repeated games support equilibrium concepts beyond those present in one-shot games (e.g., cooperation in the prisoner's dilemma). Nonetheless, repeated games fail to capture our real-world intuition for settings with many anonymous agents interacting in pairs. Repeated games with restarts, introduced by Berker and Conitzer [IJCAI '24], address this concern by giving players the option to restart the game with someone new whenever their partner deviates from an agreed-upon sequence of actions. In their work, they studied symmetric games with symmetric strategies. We significantly extend these results, introducing and analyzing more general notions of equilibria in asymmetric games with restarts. We characterize which goal strategies players can be incentivized to play in equilibrium, and we consider the computational problem of finding such sequences of actions with minimal cost for the agents. We show that this problem is NP-hard in general. However, when the goal sequence maximizes social welfare, we give a pseudo-polynomial time algorithm.
