Is Generator Conditioning Causally Related to GAN Performance?

Augustus Odena; Jacob Buckman; Catherine Olsson; Tom B. Brown; Christopher Olah; Colin Raffel; Ian Goodfellow

Is Generator Conditioning Causally Related to GAN Performance?

Augustus Odena, Jacob Buckman, Catherine Olsson, Tom B. Brown, Christopher Olah, Colin Raffel, Ian Goodfellow

TL;DR

The paper shows that the generator Jacobian in GANs becomes ill-conditioned early in training and that this conditioning strongly predicts common quality metrics like Inception Score and FID. It tests causality by introducing Jacobian Clamping, a simple regularizer that constrains the Jacobian's spectrum, and demonstrates improvements in mean scores and a dramatic reduction in inter-run variance across multiple datasets. The results argue for a causal link between Jacobian conditioning and GAN performance and offer a practical technique to stabilize training and evaluation. Overall, the work provides both a diagnostic framework based on local geometry and a actionable method to enhance GAN reliability and efficiency.

Abstract

Recent work (Pennington et al, 2017) suggests that controlling the entire distribution of Jacobian singular values is an important design consideration in deep learning. Motivated by this, we study the distribution of singular values of the Jacobian of the generator in Generative Adversarial Networks (GANs). We find that this Jacobian generally becomes ill-conditioned at the beginning of training. Moreover, we find that the average (with z from p(z)) conditioning of the generator is highly predictive of two other ad-hoc metrics for measuring the 'quality' of trained GANs: the Inception Score and the Frechet Inception Distance (FID). We test the hypothesis that this relationship is causal by proposing a 'regularization' technique (called Jacobian Clamping) that softly penalizes the condition number of the generator Jacobian. Jacobian Clamping improves the mean Inception Score and the mean FID for GANs trained on several datasets. It also greatly reduces inter-run variance of the aforementioned scores, addressing (at least partially) one of the main criticisms of GANs.

Is Generator Conditioning Causally Related to GAN Performance?

TL;DR

Abstract

Is Generator Conditioning Causally Related to GAN Performance?

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)