Approximation and Convergence Properties of Generative Adversarial Learning
Shuang Liu, Olivier Bousquet, Kamalika Chaudhuri
TL;DR
This paper investigates how discriminator restriction and objective-function choice affect GANs' ability to approximate a target distribution. It introduces the notion of adversarial divergences to unify various GAN objectives and distinguishes strict adversarial divergences, which ensure unique minimizers. It proves that restricting discriminators yields generalized moment matching with the target and that neural-network discriminators preserve this property; it also shows that convergence in a strict adversarial divergence implies weak convergence to the target, with Wasserstein distance exemplifying the weakest such divergence. The results provide a broad, principled framework connecting multiple GAN variants and clarifying the convergence behavior under different divergences.
Abstract
Generative adversarial networks (GAN) approximate a target data distribution by jointly optimizing an objective function through a "two-player game" between a generator and a discriminator. Despite their empirical success, however, two very basic questions on how well they can approximate the target distribution remain unanswered. First, it is not known how restricting the discriminator family affects the approximation quality. Second, while a number of different objective functions have been proposed, we do not understand when convergence to the global minima of the objective function leads to convergence to the target distribution under various notions of distributional convergence. In this paper, we address these questions in a broad and unified setting by defining a notion of adversarial divergences that includes a number of recently proposed objective functions. We show that if the objective function is an adversarial divergence with some additional conditions, then using a restricted discriminator family has a moment-matching effect. Additionally, we show that for objective functions that are strict adversarial divergences, convergence in the objective function implies weak convergence, thus generalizing previous results.
