On the Statistical Capacity of Deep Generative Models
Edric Tam, David B. Dunson
TL;DR
The paper analyzes the statistical capacity of deep generative models by deriving non-asymptotic, dimension-free concentration bounds for the outputs of Lipschitz push-forwards under various latent distributions. It shows that with Gaussian latents, the generator error is sub-Gaussian, implying light tails and a lack of universality for heavy-tailed targets; these results extend to log-concave latents, strongly log-concave latents, and latents on manifolds with positive Ricci curvature, and to diffusion models via a reduction to a single Lipschitz transform. The theoretical guarantees are complemented by simulations and financial data illustrating the practical limitation: such models underrepresent tail uncertainty, which matters for anomaly detection and risk-sensitive tasks. The findings motivate exploring richer latent priors or non-Lipschitz generative mechanisms to better capture heavy-tailed phenomena in real data.
Abstract
Deep generative models are routinely used in generating samples from complex, high-dimensional distributions. Despite their apparent successes, their statistical properties are not well understood. A common assumption is that with enough training data and sufficiently large neural networks, deep generative model samples will have arbitrarily small errors in sampling from any continuous target distribution. We set up a unifying framework that debunks this belief. We demonstrate that broad classes of deep generative models, including variational autoencoders and generative adversarial networks, are not universal generators. Under the predominant case of Gaussian latent variables, these models can only generate concentrated samples that exhibit light tails. Using tools from concentration of measure and convex geometry, we give analogous results for more general log-concave and strongly log-concave latent variable distributions. We extend our results to diffusion models via a reduction argument. We use the Gromov--Levy inequality to give similar guarantees when the latent variables lie on manifolds with positive Ricci curvature. These results shed light on the limited capacity of common deep generative models to handle heavy tails. We illustrate the empirical relevance of our work with simulations and financial data.
