A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks

Neel Mishra; Bamdev Mishra; Pratik Jawanpuria; Pawan Kumar

A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks

Neel Mishra, Bamdev Mishra, Pratik Jawanpuria, Pawan Kumar

TL;DR

This work addresses instability and inefficiency in training GANs under the min-max objective by introducing a Gauss-Newton preconditioned solver with Sherman-Morrison inversion. The method constructs a rank-one update to approximate the min-max Hessian and defines a fixed-point iteration with provable local convergence for $\lambda\in(0,1)$. Empirically, it yields high-fidelity and diverse generations across grayscale and color datasets, achieving the CIFAR10 inception score of $\text{IS}=5.82$ while maintaining runtimes comparable to first-order optimizers like Adam. The approach delivers a practically impactful balance of stability, speed, and image quality, demonstrated on MNIST, Fashion-MNIST, CIFAR10, FFHQ, and LSUN.

Abstract

A novel first-order method is proposed for training generative adversarial networks (GANs). It modifies the Gauss-Newton method to approximate the min-max Hessian and uses the Sherman-Morrison inversion formula to calculate the inverse. The method corresponds to a fixed-point method that ensures necessary contraction. To evaluate its effectiveness, numerical experiments are conducted on various datasets commonly used in image generation tasks, such as MNIST, Fashion MNIST, CIFAR10, FFHQ, and LSUN. Our method is capable of generating high-fidelity images with greater diversity across multiple datasets. It also achieves the highest inception score for CIFAR10 among all compared methods, including state-of-the-art second-order methods. Additionally, its execution time is comparable to that of first-order min-max methods.

A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks

TL;DR

. Empirically, it yields high-fidelity and diverse generations across grayscale and color datasets, achieving the CIFAR10 inception score of

while maintaining runtimes comparable to first-order optimizers like Adam. The approach delivers a practically impactful balance of stability, speed, and image quality, demonstrated on MNIST, Fashion-MNIST, CIFAR10, FFHQ, and LSUN.

Abstract

Paper Structure (14 sections, 6 theorems, 17 equations, 6 figures, 6 tables, 2 algorithms)

This paper contains 14 sections, 6 theorems, 17 equations, 6 figures, 6 tables, 2 algorithms.

Introduction
The min-max problem in GAN
Related work on min-max solvers for GANs
Contributions of the paper:
Proposed solver
Fixed point iteration and convergence
Numerical results
Experimental setup
Image generation on grey scale images
Image generation on color images
Computational complexity and timing comparisons
Code Repository
Conclusion
Acknowledgment

Key Result

Lemma 1

For zero-sum games, $v'(p)$ is negative semi-definite if and only if $\nabla_{xx}^{2}f(x,y)$ is negative semi-definite and $\nabla_{yy}^{2}f(x,y)$ is positive semi-definite.

Figures (6)

Figure 1: Results for CIFAR10.
Figure 2: Results for CIFAR10.
Figure 3: Images generated for MNIST. Samples inside white box show mode-collapse.
Figure 4: Images generated for Fashion MNIST. Samples inside white box show mode-collapse.
Figure 5: Images generated by our method on LSUN-tower and LSUN-bridges.
...and 1 more figures

Theorems & Definitions (14)

Remark 1
Remark 2
Lemma 1
proof
Corollary 1
proof
Proposition 1
proof
Lemma 2
proof
...and 4 more

A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks

TL;DR

Abstract

A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial Networks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (14)