Global Convergence to Nash Equilibrium in Nonconvex General-Sum Games under the $n$-Sided PL Condition
Yutong Chao, Jalal Etesami
TL;DR
The paper addresses finding Nash equilibria in nonconvex general-sum games by introducing the $n$-sided PL condition, which extends gradient-dominance concepts to multi-coordinate settings and ensures that the stationary-point set coincides with the NE set. It analyzes block-coordinate descent (BCD) under this condition and proposes adaptive variants, IA-RBCD and A-RBCD, that leverage the gradient of the aggregate best-response $G_F$ to drive convergence to NE, even when NE are nonunique and the objectives are not globally lower bounded. The authors establish linear convergence under an additional cross-term bound and provide general sublinear rates when a $(\\theta,\\nu)$-PL condition holds for the function $F-G_F$, along with practical schemes to approximate best responses. They validate the approach on potential and general-sum problems, including Cournot competition and infinite-horizon LQ games, demonstrating improved convergence over traditional BCD baselines and robustness to strict saddle points. The results advance theoretical guarantees for gradient-based multi-agent optimization beyond two-block or convex settings and offer algorithmic tools for reliably locating NE in complex nonconvex games with practical impact in economics and control.
Abstract
We consider the problem of finding a Nash equilibrium (NE) in a general-sum game, where player $i$'s objective is $f_i(x)=f_i(x_1,...,x_n)$, with $x_j\in\mathbb{R}^{d_j}$ denoting the strategy variables of player $j$. Our focus is on investigating first-order gradient-based algorithms and their variations, such as the block coordinate descent (BCD) algorithm, for tackling this problem. We introduce a set of conditions, called the $n$-sided PL condition, which extends the well-established gradient dominance condition a.k.a Polyak-Łojasiewicz (PL) condition and the concept of multi-convexity. This condition, satisfied by various classes of non-convex functions, allows us to analyze the convergence of various gradient descent (GD) algorithms. Moreover, our study delves into scenarios where the standard gradient descent methods fail to converge to NE. In such cases, we propose adapted variants of GD that converge towards NE and analyze their convergence rates. Finally, we evaluate the performance of the proposed algorithms through several experiments.
