Geodesic Calculus on Latent Spaces
Florine Hartwig, Josua Sassen, Juliane Braunsmann, Martin Rumpf, Benedikt Wirth
TL;DR
The paper tackles the challenge of performing meaningful geometric operations on latent spaces of autoencoders, which are typically implicit and lack explicit manifold structure. It proposes describing latent representations as implicit submanifolds $\mathcal{Z}=\{z: \zeta(z)=0\}$ with a learned projection $\Pi_\sigma$ (via a denoising objective) to obtain a robust implicit representation and allow Riemannian calculus. A time-discrete geodesic calculus is developed and implemented with an augmented Lagrangian approach to compute discrete geodesics and discrete exponential maps on $\mathcal{Z}$ under various metrics, enabling geodesic interpolation and extrapolation directly in latent space. The framework is validated across multiple data modalities, including discrete shells, motion capture with spherical VAEs, and image data, showing improved interpolation behavior and plausible latent-geodesic paths when decoded. This work thus enables practical, geometry-aware manipulation of latent representations, with potential to extend to distance-based and probabilistic latent models and to richer geometric constructions such as parallel transport and curvature.
Abstract
Latent manifolds of autoencoders provide low-dimensional representations of data, which can be studied from a geometric perspective. We propose to describe these latent manifolds as implicit submanifolds of some ambient latent space. Based on this, we develop tools for a discrete Riemannian calculus approximating classical geometric operators. These tools are robust against inaccuracies of the implicit representation often occurring in practical examples. To obtain a suitable implicit representation, we propose to learn an approximate projection onto the latent manifold by minimizing a denoising objective. This approach is independent of the underlying autoencoder and supports the use of different Riemannian geometries on the latent manifolds. The framework in particular enables the computation of geodesic paths connecting given end points and shooting geodesics via the Riemannian exponential maps on latent manifolds. We evaluate our approach on various autoencoders trained on synthetic and real data.
