Global Convergence to Nash Equilibrium in Nonconvex General-Sum Games under the $n$-Sided PL Condition

Yutong Chao; Jalal Etesami

Global Convergence to Nash Equilibrium in Nonconvex General-Sum Games under the $n$-Sided PL Condition

Yutong Chao, Jalal Etesami

TL;DR

The paper addresses finding Nash equilibria in nonconvex general-sum games by introducing the $n$-sided PL condition, which extends gradient-dominance concepts to multi-coordinate settings and ensures that the stationary-point set coincides with the NE set. It analyzes block-coordinate descent (BCD) under this condition and proposes adaptive variants, IA-RBCD and A-RBCD, that leverage the gradient of the aggregate best-response $G_F$ to drive convergence to NE, even when NE are nonunique and the objectives are not globally lower bounded. The authors establish linear convergence under an additional cross-term bound and provide general sublinear rates when a $(\\theta,\\nu)$-PL condition holds for the function $F-G_F$, along with practical schemes to approximate best responses. They validate the approach on potential and general-sum problems, including Cournot competition and infinite-horizon LQ games, demonstrating improved convergence over traditional BCD baselines and robustness to strict saddle points. The results advance theoretical guarantees for gradient-based multi-agent optimization beyond two-block or convex settings and offer algorithmic tools for reliably locating NE in complex nonconvex games with practical impact in economics and control.

Abstract

We consider the problem of finding a Nash equilibrium (NE) in a general-sum game, where player $i$'s objective is $f_i(x)=f_i(x_1,...,x_n)$, with $x_j\in\mathbb{R}^{d_j}$ denoting the strategy variables of player $j$. Our focus is on investigating first-order gradient-based algorithms and their variations, such as the block coordinate descent (BCD) algorithm, for tackling this problem. We introduce a set of conditions, called the $n$-sided PL condition, which extends the well-established gradient dominance condition a.k.a Polyak-Łojasiewicz (PL) condition and the concept of multi-convexity. This condition, satisfied by various classes of non-convex functions, allows us to analyze the convergence of various gradient descent (GD) algorithms. Moreover, our study delves into scenarios where the standard gradient descent methods fail to converge to NE. In such cases, we propose adapted variants of GD that converge towards NE and analyze their convergence rates. Finally, we evaluate the performance of the proposed algorithms through several experiments.

Global Convergence to Nash Equilibrium in Nonconvex General-Sum Games under the $n$-Sided PL Condition

TL;DR

The paper addresses finding Nash equilibria in nonconvex general-sum games by introducing the

-sided PL condition, which extends gradient-dominance concepts to multi-coordinate settings and ensures that the stationary-point set coincides with the NE set. It analyzes block-coordinate descent (BCD) under this condition and proposes adaptive variants, IA-RBCD and A-RBCD, that leverage the gradient of the aggregate best-response

to drive convergence to NE, even when NE are nonunique and the objectives are not globally lower bounded. The authors establish linear convergence under an additional cross-term bound and provide general sublinear rates when a

-PL condition holds for the function

, along with practical schemes to approximate best responses. They validate the approach on potential and general-sum problems, including Cournot competition and infinite-horizon LQ games, demonstrating improved convergence over traditional BCD baselines and robustness to strict saddle points. The results advance theoretical guarantees for gradient-based multi-agent optimization beyond two-block or convex settings and offer algorithmic tools for reliably locating NE in complex nonconvex games with practical impact in economics and control.

Abstract

We consider the problem of finding a Nash equilibrium (NE) in a general-sum game, where player

's objective is

, with

denoting the strategy variables of player

. Our focus is on investigating first-order gradient-based algorithms and their variations, such as the block coordinate descent (BCD) algorithm, for tackling this problem. We introduce a set of conditions, called the

-sided PL condition, which extends the well-established gradient dominance condition a.k.a Polyak-Łojasiewicz (PL) condition and the concept of multi-convexity. This condition, satisfied by various classes of non-convex functions, allows us to analyze the convergence of various gradient descent (GD) algorithms. Moreover, our study delves into scenarios where the standard gradient descent methods fail to converge to NE. In such cases, we propose adapted variants of GD that converge towards NE and analyze their convergence rates. Finally, we evaluate the performance of the proposed algorithms through several experiments.

Paper Structure (37 sections, 14 theorems, 58 equations, 7 figures, 5 algorithms)

This paper contains 37 sections, 14 theorems, 58 equations, 7 figures, 5 algorithms.

Introduction
Related Work
$n$-sided PL Condition
Definitions and Assumptions
Algorithms and Convergence Analysis
Potential setting:
General-sum setting:
Convergence of random BCD under an additional assumption
Adaptive random BCD with the knowledge of exact best responses
Adaptive random BCD without the knowledge of exact best responses
Applications
Benchmarks
Function with only strict saddle point:
Cournot Competition:
Infinite Horizon $n$-player Linear-quadratic (LQ) Game:
...and 22 more sections

Key Result

Lemma 2.5

If functions $\{f_i(x_1,...,x_n)\}$ satisfy the $n$-sided PL condition, then $\mathcal{S}_0(f_1,\cdots,f_n)=\mathcal{N}(f_1,\cdots,f_n)$.

Figures (7)

Figure 1: Left is function $f^{(1)}(x_1,x_2)$ and right is $f^{(2)}(x_1,x_2)$.
Figure 2: The result of R-BCD algorithm applied to potential setting with functions $f^{(3)}(x_1,x_2)$ and $f^{(4)}(x_1,x_2)$. The y-axis is in log scale, thus the R-BCD demonstrates linear convergence for $f^{(4)}$.
Figure 3: The result of R-BCD algorithm applied to functions $\{f^{(5)}_{1},f^{(5)}_{2}\}$ and $\{f^{(6)}_{1},f^{(6)}_{2}\}$. The y-axis is in log scale, thus the BCD shows linear convergence in the second game.
Figure 4: Performances of A-RBCD, BM1, and BM2 applied to functions $f_1$ and $f_2$.
Figure 5: (a) Performances of A-RBCD, BM1, and BM2 applied to the Cournot model with $P(Q)=a-bQ$; (b) with $P(Q)=a-bQ^2$.
...and 2 more figures

Theorems & Definitions (19)

Definition 2.2: Nash Equilibrium (NE)
Definition 2.3: $\varepsilon$-Partial Stationary point
Definition 2.4: $n$-sided PL Condition
Lemma 2.5
Lemma 3.1
Lemma 3.2
Theorem 3.3
Definition 3.4: $(\theta,\nu)$-PL condition
Theorem 3.5
Theorem 3.6
...and 9 more

Global Convergence to Nash Equilibrium in Nonconvex General-Sum Games under the $n$-Sided PL Condition

TL;DR

Abstract

Global Convergence to Nash Equilibrium in Nonconvex General-Sum Games under the $n$-Sided PL Condition

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (19)