BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling

Lars Maaløe; Marco Fraccaro; Valentin Liévin; Ole Winther

BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling

Lars Maaløe, Marco Fraccaro, Valentin Liévin, Ole Winther

TL;DR

BIVA introduces a very deep hierarchical latent-variable model that uses a deterministic top-down pathway and a bidirectional bottom-up/top-down inference network to enable rich posterior covariances and robust information flow. The approach yields state-of-the-art likelihoods on several benchmarks, produces sharp natural-image samples, and supports anomaly detection and semi-supervised classification. Through extensive ablations and diverse experiments, the work demonstrates that deep latent hierarchies with skip connections can match or exceed non-autoregressive methods and close the gap with autoregressive/flow-based models. This highlights the practical value of structured latent representations for high-quality generation and reliable anomaly detection in complex data distributions.

Abstract

With the introduction of the variational autoencoder (VAE), probabilistic latent variable models have received renewed attention as powerful generative models. However, their performance in terms of test likelihood and quality of generated samples has been surpassed by autoregressive models without stochastic units. Furthermore, flow-based models have recently been shown to be an attractive alternative that scales well to high-dimensional data. In this paper we close the performance gap by constructing VAE models that can effectively utilize a deep hierarchy of stochastic variables and model complex covariance structures. We introduce the Bidirectional-Inference Variational Autoencoder (BIVA), characterized by a skip-connected generative model and an inference network formed by a bidirectional stochastic inference path. We show that BIVA reaches state-of-the-art test likelihoods, generates sharp and coherent natural images, and uses the hierarchy of latent variables to capture different aspects of the data distribution. We observe that BIVA, in contrast to recent results, can be used for anomaly detection. We attribute this to the hierarchy of latent variables which is able to extract high-level semantic features. Finally, we extend BIVA to semi-supervised classification tasks and show that it performs comparably to state-of-the-art results by generative adversarial networks.

BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling

TL;DR

Abstract

BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)