Analysis of learning a flow-based generative model from limited sample complexity
Hugo Cui, Florent Krzakala, Eric Vanden-Eijnden, Lenka Zdeborová
TL;DR
This work provides a tight end-to-end asymptotic analysis of learning a flow-based generative model for a Gaussian mixture using a shallow two-layer autoencoder to parameterize the velocity field. It derives closed-form expressions for the learned velocity and its associated summary statistics, and shows that the generated mixture’s mean converges to the target mean at a rate of Θ(1/n), which is Bayes-optimal. The study reduces the high-dimensional transport to a small set of scalar ODEs, revealing how finite sample effects and architectural biases shape the generative process and highlighting when memorization occurs. Overall, it offers precise guidance on the interplay between learning from limited data and the quality of the resulting density sampling, including insights on when skip connections are beneficial.
Abstract
We study the problem of training a flow-based generative model, parametrized by a two-layer autoencoder, to sample from a high-dimensional Gaussian mixture. We provide a sharp end-to-end analysis of the problem. First, we provide a tight closed-form characterization of the learnt velocity field, when parametrized by a shallow denoising auto-encoder trained on a finite number $n$ of samples from the target distribution. Building on this analysis, we provide a sharp description of the corresponding generative flow, which pushes the base Gaussian density forward to an approximation of the target density. In particular, we provide closed-form formulae for the distance between the mean of the generated mixture and the mean of the target mixture, which we show decays as $Θ_n(\frac{1}{n})$. Finally, this rate is shown to be in fact Bayes-optimal.
