Sylvester Normalizing Flows for Variational Inference
Rianne van den Berg, Leonard Hasenclever, Jakub M. Tomczak, Max Welling
TL;DR
The paper addresses the limitation of simple variational posteriors in VI by introducing Sylvester normalizing flows (SNFs), a generalization of planar flows that uses Sylvester's determinant identity to maintain a tractable Jacobian while removing bottlenecks. SNFs are parameterized to allow flexible, data-dependent transformations, including three orthogonality-preserving variants: Orthogonal, Householder, and Triangular Sylvester flows, with a hypernetwork enabling data-conditioned flow parameters. Empirical results on MNIST, FreyFaces, Omniglot, and Caltech 101 Silhouettes show SNFs often outperform planar flows and IAF, particularly on larger or more complex datasets, while some datasets like FreyFaces favor planar flows. The approach offers a scalable, expressive, and data-adaptive way to enrich variational posteriors, potentially improving ELBO tightness and generative performance in VAEs and related models.
Abstract
Variational inference relies on flexible approximate posterior distributions. Normalizing flows provide a general recipe to construct flexible variational posteriors. We introduce Sylvester normalizing flows, which can be seen as a generalization of planar flows. Sylvester normalizing flows remove the well-known single-unit bottleneck from planar flows, making a single transformation much more flexible. We compare the performance of Sylvester normalizing flows against planar flows and inverse autoregressive flows and demonstrate that they compare favorably on several datasets.
