Deep Generative Models: Complexity, Dimensionality, and Approximation
Kevin Wang, Hongqian Niu, Yixin Wang, Didong Li
TL;DR
This work provides a theoretical framework showing that deep generative models can approximate distributions on $d$-dimensional Riemannian manifolds from inputs of arbitrary dimension $m\ge 1$, even when $m<d$, by leveraging space-filling curve concepts. It establishes a concrete complexity-accuracy trade-off: as input dimension decreases below the manifold dimension, the required network width grows super-exponentially, revealing a fundamental limitation and guiding practical choices for input dimension. The main results extend previous $d+1$-dimensional constructions to arbitrary $m$, with population and empirical guarantees in the Wasserstein metric, and they are complemented by toy simulations illustrating space-filling behavior. The findings have implications for understanding the practical success of generative models and inform architecture design and dimension estimation in high-dimensional data settings.
Abstract
Generative networks have shown remarkable success in learning complex data distributions, particularly in generating high-dimensional data from lower-dimensional inputs. While this capability is well-documented empirically, its theoretical underpinning remains unclear. One common theoretical explanation appeals to the widely accepted manifold hypothesis, which suggests that many real-world datasets, such as images and signals, often possess intrinsic low-dimensional geometric structures. Under this manifold hypothesis, it is widely believed that to approximate a distribution on a $d$-dimensional Riemannian manifold, the latent dimension needs to be at least $d$ or $d+1$. In this work, we show that this requirement on the latent dimension is not necessary by demonstrating that generative networks can approximate distributions on $d$-dimensional Riemannian manifolds from inputs of any arbitrary dimension, even lower than $d$, taking inspiration from the concept of space-filling curves. This approach, in turn, leads to a super-exponential complexity bound of the deep neural networks through expanded neurons. Our findings thus challenge the conventional belief on the relationship between input dimensionality and the ability of generative networks to model data distributions. This novel insight not only corroborates the practical effectiveness of generative networks in handling complex data structures, but also underscores a critical trade-off between approximation error, dimensionality, and model complexity.
