PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium

Shihong Ding; Hanze Dong; Cong Fang; Zhouchen Lin; Tong Zhang

PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium

Shihong Ding, Hanze Dong, Cong Fang, Zhouchen Lin, Tong Zhang

TL;DR

PAPAL is the first implementable particle-based algorithm accompanied by non-asymptotic quantitative convergence results, running time, and sample complexity guarantees and is the first implementable particle-based algorithm accompanied by non-asymptotic quantitative convergence results, running time and sample complexity guarantees.

Abstract

We consider the non-convex non-concave objective function in two-player zero-sum continuous games. The existence of pure Nash equilibrium requires stringent conditions, posing a major challenge for this problem. To circumvent this difficulty, we examine the problem of identifying a mixed Nash equilibrium, where strategies are randomized and characterized by probability distributions over continuous domains. To this end, we propose PArticle-based Primal-dual ALgorithm (PAPAL) tailored for a weakly entropy-regularized min-max optimization over probability distributions. This algorithm employs the stochastic movements of particles to represent the updates of random strategies for the $ε$-mixed Nash equilibrium. We offer a comprehensive convergence analysis of the proposed algorithm, demonstrating its effectiveness. In contrast to prior research that attempted to update particle importance without movements, PAPAL is the first implementable particle-based algorithm accompanied by non-asymptotic quantitative convergence results, running time, and sample complexity guarantees. Our framework contributes novel insights into the particle-based algorithms for continuous min-max optimization in the general non-convex non-concave setting.

PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium

TL;DR

Abstract

-mixed Nash equilibrium. We offer a comprehensive convergence analysis of the proposed algorithm, demonstrating its effectiveness. In contrast to prior research that attempted to update particle importance without movements, PAPAL is the first implementable particle-based algorithm accompanied by non-asymptotic quantitative convergence results, running time, and sample complexity guarantees. Our framework contributes novel insights into the particle-based algorithms for continuous min-max optimization in the general non-convex non-concave setting.

Paper Structure (39 sections, 27 theorems, 144 equations, 2 figures, 3 algorithms)

This paper contains 39 sections, 27 theorems, 144 equations, 2 figures, 3 algorithms.

Introduction
Contributions.
Related Work
Min-Max Optimization.
Mixed Nash Equilibrium.
Mean-field Analysis.
Preliminary
Notation.
Problem Setup.
Assumptions.
Primal-Dual Algorithms for Min-Max Optimization.
Unadjusted Langevin Algorithm.
PAPAL: PArticle-based Primal-dual ALgorithm
Exact Primal-dual Method
Recursive Energy Function
...and 24 more sections

Key Result

Lemma 1

Let $\lambda>0$ be a positive real number and $\tilde{l}(\theta), l(\theta)$ be bounded continuous functions. Consider a probability density $\Bar{p}(\theta)\propto\exp\left(-\tilde{l}(\theta)\right)$, then $p(\theta)\propto\exp\left(-\frac{1}{\lambda+\tau}l(\theta)-\frac{\tau}{\lambda+\tau}\tilde{l

Figures (2)

Figure 1: Comparison between weight-driven and particle-based algorithms.
Figure 2: Illustration of the sample efficiency on Generative Adversarial Networks. ($x$-axis is the dimension; $y$-axis is the KL divergence to measure the distance between real and fake distribution; $n$ denotes the number of particles)

Theorems & Definitions (32)

Definition 1: $\epsilon$-MNE
Definition 2
Lemma 1
Remark 1
Remark 2
Theorem 1
Lemma 2
Theorem 2
Corollary 1
Lemma 3
...and 22 more

PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium

TL;DR

Abstract

PAPAL: A Provable PArticle-based Primal-Dual ALgorithm for Mixed Nash Equilibrium

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (32)