Fisher GAN
Youssef Mroueh, Tom Sercu
TL;DR
The paper tackles GAN training instability by introducing Fisher IPM, a scale-invariant distribution distance achieved by constraining the critic's second-order moments in a data-dependent way.By interpreting the neural-network-parameterized critic as whitening mean embeddings, Fisher IPM yields a Mahalanobis-distance-based discrepancy that remains computationally efficient and avoids aggressive weight clipping or costly gradient penalties.The authors prove that, with full function capacity, Fisher IPM equals the Chi-squared distance, derive a practical ALM-based optimization algorithm for training, and provide generalization bounds for the learned critic; they validate the approach with stable training, fast convergence, and competitive semi-supervised results on standard benchmarks.
Abstract
Generative Adversarial Networks (GANs) are powerful models for learning complex distributions. Stable training of GANs has been addressed in many recent works which explore different metrics between distributions. In this paper we introduce Fisher GAN which fits within the Integral Probability Metrics (IPM) framework for training GANs. Fisher GAN defines a critic with a data dependent constraint on its second order moments. We show in this paper that Fisher GAN allows for stable and time efficient training that does not compromise the capacity of the critic, and does not need data independent constraints such as weight clipping. We analyze our Fisher IPM theoretically and provide an algorithm based on Augmented Lagrangian for Fisher GAN. We validate our claims on both image sample generation and semi-supervised classification using Fisher GAN.
