Latent Space Non-Linear Statistics
Line Kuhnel, Tom Fletcher, Sarang Joshi, Stefan Sommer
TL;DR
The paper addresses the challenge that standard Euclidean statistics do not apply to latent spaces learned by deep generative models, where the latent manifold inherits a nonlinear Riemannian geometry from the data space. It leverages nonlinear manifold statistics—Fréchet/ML means, principal geodesic analysis (PGA), and diffusion-based inference—to perform meaningful statistics directly in the latent space, while introducing a neural network to rapidly approximate the latent metric g and its cometric g^{-1}. Key contributions include coupling nonlinear statistics with pre-trained latent representations, a fast metric-approximation network that dramatically speeds up geometric computations, and diffusion-bridge–based maximum likelihood inference for latent means, demonstrated on synthetic data and MNIST within the Theano Geometry framework. The work provides geometry-aware, sample-efficient tools for analyzing data on learned manifolds, with potential impact in domains such as medical imaging where labeled data are scarce but unlabeled data are abundant.
Abstract
Given data, deep generative models, such as variational autoencoders (VAE) and generative adversarial networks (GAN), train a lower dimensional latent representation of the data space. The linear Euclidean geometry of data space pulls back to a nonlinear Riemannian geometry on the latent space. The latent space thus provides a low-dimensional nonlinear representation of data and classical linear statistical techniques are no longer applicable. In this paper we show how statistics of data in their latent space representation can be performed using techniques from the field of nonlinear manifold statistics. Nonlinear manifold statistics provide generalizations of Euclidean statistical notions including means, principal component analysis, and maximum likelihood fits of parametric probability distributions. We develop new techniques for maximum likelihood inference in latent space, and adress the computational complexity of using geometric algorithms with high-dimensional data by training a separate neural network to approximate the Riemannian metric and cometric tensor capturing the shape of the learned data manifold.
