Pointwise Convergence in Games with Conflicting Interest
Nanxiang Zhou, Jing Dong, Baoxiang Wang
TL;DR
The paper develops non-negative weighted regret to unify analyses of harmonic and zero-sum games, enabling a common framework for learning dynamics in conflicting-interest settings. It analyzes optimistic no-regret algorithms, specifically OMD and OFTRL, proving they reach an $ε$-approximate Nash equilibrium in $O(1/ε^2)$ iterations and exhibit pointwise convergence to NE when the NE set is finite. The results extend to corrupted dynamics with finite deviations, retaining convergence guarantees and showing equivalence between corrupted OMD and OFTRL. Empirical evaluations on Matching Pennies and harmonic games corroborate the theory, illustrating practical convergence behavior and robustness of the proposed methods.
Abstract
In this work, we introduce the concept of non-negative weighted regret, an extension of non-negative regret \cite{anagnostides2022last} in games. Investigating games with non-negative weighted regret helps us to understand games with conflicting interests, including harmonic games and important classes of zero-sum games.We show that optimistic variants of classical no-regret learning algorithms, namely optimistic mirror descent (OMD) and optimistic follow the regularized leader (OFTRL), converge to an $ε$-approximate Nash equilibrium at a rate of $O(1/ε^2)$.Consequently, they guarantee pointwise convergence to a Nash equilibrium if there are only finitely many Nash equilibria in the game. These algorithms are robust in the sense the convergence holds even if the players deviate Our theoretical findings are supported by empirical evaluations of OMD and OFTRL on the game of matching pennies and harmonic game instances.
