Mirror Descent-Ascent for mean-field min-max problems

Razvan-Andrei Lascu; Mateusz B. Majka; Łukasz Szpruch

Mirror Descent-Ascent for mean-field min-max problems

Razvan-Andrei Lascu, Mateusz B. Majka, Łukasz Szpruch

TL;DR

The work advances mean-field min–max optimization by casting training tasks like GANs into convex-concave games over probability measures and applying infinite-dimensional mirror descent-ascent. By distinguishing simultaneous and sequential playing, the authors prove time-averaged convergence to mixed Nash equilibria with NI errors decaying as $O(N^{-1/2})$ and $O(N^{-2/3})$, respectively, under appropriate relative smoothness and regularity assumptions. A key novelty is the duality-based analysis for the sequential scheme, which reveals faster convergence through control of dual Bregman divergences and commutators. The results generalize finite-dimensional rates to the mean-field setting and align with practical training practices that alternately optimize generator and discriminator components in GANs. Overall, the paper provides rigorous rates, a duality toolkit, and a principled justification for alternating updates in mean-field min–max problems.

Abstract

We study two variants of the mirror descent-ascent algorithm for solving min-max problems on the space of measures: simultaneous and sequential. We work under assumptions of convexity-concavity and relative smoothness of the payoff function with respect to a suitable Bregman divergence, defined on the space of measures via flat derivatives. We show that the convergence rates to mixed Nash equilibria, measured in the Nikaidò-Isoda error, are of order $\mathcal{O}\left(N^{-1/2}\right)$ and $\mathcal{O}\left(N^{-2/3}\right)$ for the simultaneous and sequential schemes, respectively, which is in line with the state-of-the-art results for related finite-dimensional algorithms.

Mirror Descent-Ascent for mean-field min-max problems

TL;DR

and

, respectively, under appropriate relative smoothness and regularity assumptions. A key novelty is the duality-based analysis for the sequential scheme, which reveals faster convergence through control of dual Bregman divergences and commutators. The results generalize finite-dimensional rates to the mean-field setting and align with practical training practices that alternately optimize generator and discriminator components in GANs. Overall, the paper provides rigorous rates, a duality toolkit, and a principled justification for alternating updates in mean-field min–max problems.

Abstract

and

for the simultaneous and sequential schemes, respectively, which is in line with the state-of-the-art results for related finite-dimensional algorithms.

Paper Structure (22 sections, 14 theorems, 152 equations)

This paper contains 22 sections, 14 theorems, 152 equations.

Introduction
Notation and setup
Motivating example: Training of GANs
Simultaneous and sequential MDA
Related works
Our contribution
Main results
Convergence of the simultaneous MDA scheme 1.6
Convergence of the sequential MDA scheme 1.7
Proofs of Theorem \ref{['thm: conv-sim-bregman']} and Theorem \ref{['thm:conv-alt-bregman']}
Proof of Theorem \ref{['thm: conv-sim-bregman']}
Proof of Theorem \ref{['thm:conv-alt-bregman']}
Proofs of additional results
Proof of Lemma \ref{['lemma:tau2-bound-primal']}
Proof of Lemma \ref{['lemma:Bregman-prox-ineq']}
...and 7 more sections

Key Result

Lemma 2.4

Let Assumption assumption:assump-h, def:relative-smoothness and assumption:F-lip hold. Suppose that $\tau L \leq \frac{1}{2},$ with $L \coloneqq \max\{L_{\nu}, L_{\mu}\}.$ Then, for both schemes eq:mirror-sim-explicit and eq:mirror-alt, it holds, for all $n \geq 0,$ that

Theorems & Definitions (47)

Definition 1.3: Bregman divergence
Example 1.4: Relative entropy
Remark 2.2
Remark 2.3
Lemma 2.4
Lemma 2.5: Three-point inequality
Theorem 2.6: Convergence of the simultaneous MDA scheme \ref{['eq:mirror-sim-explicit']}
Remark 2.7
Definition 2.8: Convex conjugate
Corollary 2.11
...and 37 more

Mirror Descent-Ascent for mean-field min-max problems

TL;DR

Abstract

Mirror Descent-Ascent for mean-field min-max problems

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (47)