Table of Contents
Fetching ...

How Well Can Generative Adversarial Networks Learn Densities: A Nonparametric View

Tengyuan Liang

TL;DR

This work frames GANs as nonparametric density estimators and derives rates of convergence that adapt to the smoothness of the target density. By introducing a regularized/smoothed empirical measure, the authors obtain a rate of $n^{-(\alpha+\beta)/(2(\alpha+\beta)+d)}$ for the excess GAN risk, and prove a near-matching minimax lower bound, showing near-optimality in high dimensions. They extend the theory to neural-network realizations, demonstrating that deep ReLU discriminators and generators can achieve these rates under suitable approximation properties, with explicit improvements for deeper discriminators. A key byproduct is improved generalization bounds for GANs with deep discriminators, clarifying how smoothness and architecture impact learning performance in density estimation. Overall, the paper provides a quantitative, nonparametric foundation for understanding how well GANs can learn a wide class of densities across different evaluation metrics.

Abstract

We study in this paper the rate of convergence for learning densities under the Generative Adversarial Networks (GAN) framework, borrowing insights from nonparametric statistics. We introduce an improved GAN estimator that achieves a faster rate, through simultaneously leveraging the level of smoothness in the target density and the evaluation metric, which in theory remedies the mode collapse problem reported in the literature. A minimax lower bound is constructed to show that when the dimension is large, the exponent in the rate for the new GAN estimator is near optimal. One can view our results as answering in a quantitative way how well GAN learns a wide range of densities with different smoothness properties, under a hierarchy of evaluation metrics. As a byproduct, we also obtain improved generalization bounds for GAN with deeper ReLU discriminator network.

How Well Can Generative Adversarial Networks Learn Densities: A Nonparametric View

TL;DR

This work frames GANs as nonparametric density estimators and derives rates of convergence that adapt to the smoothness of the target density. By introducing a regularized/smoothed empirical measure, the authors obtain a rate of for the excess GAN risk, and prove a near-matching minimax lower bound, showing near-optimality in high dimensions. They extend the theory to neural-network realizations, demonstrating that deep ReLU discriminators and generators can achieve these rates under suitable approximation properties, with explicit improvements for deeper discriminators. A key byproduct is improved generalization bounds for GANs with deep discriminators, clarifying how smoothness and architecture impact learning performance in density estimation. Overall, the paper provides a quantitative, nonparametric foundation for understanding how well GANs can learn a wide class of densities across different evaluation metrics.

Abstract

We study in this paper the rate of convergence for learning densities under the Generative Adversarial Networks (GAN) framework, borrowing insights from nonparametric statistics. We introduce an improved GAN estimator that achieves a faster rate, through simultaneously leveraging the level of smoothness in the target density and the evaluation metric, which in theory remedies the mode collapse problem reported in the literature. A minimax lower bound is constructed to show that when the dimension is large, the exponent in the rate for the new GAN estimator is near optimal. One can view our results as answering in a quantitative way how well GAN learns a wide range of densities with different smoothness properties, under a hierarchy of evaluation metrics. As a byproduct, we also obtain improved generalization bounds for GAN with deeper ReLU discriminator network.

Paper Structure

This paper contains 12 sections, 10 theorems, 92 equations.

Key Result

Theorem 2.1

Let $\mathcal{F}$ be any critic function class. Denote $\mu_n$ as the solution (w.r.t. the empirical estimate $\nu_n$) to GAN with generator $\mu_G$ and discriminator $\mathcal{F}_D$ Then the following decompositions hold for any distribution $\nu$,

Theorems & Definitions (25)

  • Definition 1: Sobolev space: $k \in \mathbb{N}$
  • Definition 2: Sobolev space: $\alpha \in \mathbb{R}$
  • Definition 3: Sobolev ellipsoid
  • Theorem 2.1: Oracle inequality for GAN
  • Lemma 2.1: Arbitrary density
  • Corollary 2.1: Rates for arbitrary density
  • Theorem 2.2: Nonparametric estimation with GAN framework
  • Remark 2.1
  • Corollary 2.2
  • Theorem 2.3: Minimax lower bound
  • ...and 15 more