Generative Latent Flow
Zhisheng Xiao, Qing Yan, Yali Amit
TL;DR
GLF addresses the challenge of producing high-quality samples with auto-encoder-based generative models by placing a normalizing flow on the latent space to map to Gaussian noise, enabling end-to-end single-stage training. It combines a deterministic encoder–decoder with a bijective latent-space transform, computed via affine coupling blocks, and trains with a joint reconstruction and NLL objective while employing a stop-gradient technique to maintain stable latent representations. Empirically, GLF achieves state-of-the-art AE-based sample quality across MNIST, Fashion-MNIST, CIFAR-10, and CelebA, with faster convergence than competing methods and competitive results relative to GANs. The work clarifies connections to VAEs with flow priors and highlights the practical benefits of the stop-gradient design for reliable, efficient density matching in latent space.
Abstract
In this work, we propose the Generative Latent Flow (GLF), an algorithm for generative modeling of the data distribution. GLF uses an Auto-encoder (AE) to learn latent representations of the data, and a normalizing flow to map the distribution of the latent variables to that of simple i.i.d noise. In contrast to some other Auto-encoder based generative models, which use various regularizers that encourage the encoded latent distribution to match the prior distribution, our model explicitly constructs a mapping between these two distributions, leading to better density matching while avoiding over regularizing the latent variables. We compare our model with several related techniques, and show that it has many relative advantages including fast convergence, single stage training and minimal reconstruction trade-off. We also study the relationship between our model and its stochastic counterpart, and show that our model can be viewed as a vanishing noise limit of VAEs with flow prior. Quantitatively, under standardized evaluations, our method achieves state-of-the-art sample quality among AE based models on commonly used datasets, and is competitive with GANs' benchmarks.
