On Probabilistic Pullback Metrics for Latent Hyperbolic Manifolds
Luis Augenstein, Noémie Jaquier, Tamim Asfour, Leonel Rozo
TL;DR
The paper tackles interpolation in hyperbolic latent spaces used to model hierarchical data, showing that standard hyperbolic geodesics can pass through low-data regions and increase uncertainty. It introduces a hyperbolic pullback metric induced by a stochastic immersion $f:\,\mathbb{H}^{D_x}_{\mathcal{L}} \to \mathbb{R}^{D_y}$, with the latent-space metric $\bm{G}^{\text{P}}_{\bm{x}}=\tilde{\bm{J}}^\top \bm{G}_{\bm{y}} \tilde{\bm{J}}$ and an expectation $\mathbb{E}[\bm{G}^{\text{P}}_{\bm{x}}] = \mathbb{E}[\bm{J}]^\top \mathbb{E}[\bm{J}] + D_y \bm{\Sigma}_J$, enabling geometry-aware distances. It then specializes to the gphlvm by deriving the Jacobian distribution under GP conditioning, yielding $\mathbb{E}[\bm{G}^{\text{P},\mathcal{L}}_{\bm{x}^*}] = \bm{P}_{\bm{x}^*}(\boldsymbol{\mu}_J^\top \boldsymbol{\mu}_J + D_y \boldsymbol{\Sigma}_J) \bm{P}_{\bm{x}^*}^\top$, where the projection $\bm{P}_{\bm{x}^*}$ enforces hyperbolic tangent-space constraints. Hyperbolic pullback geodesics are computed by minimizing the curve energy with the pullback metric, including a spline regularization, using a Riemannian optimizer. The work also provides analytic derivatives for 2D and 3D hyperbolic SE kernels to enable efficient gradient-based optimization, and demonstrates through four experiments that pullback geodesics follow data support and reduce uncertainty compared with baseline geodesics, highlighting practical gains in manifold-aware latent distances and generative interpolations.
Abstract
Probabilistic Latent Variable Models (LVMs) excel at modeling complex, high-dimensional data through lower-dimensional representations. Recent advances show that equipping these latent representations with a Riemannian metric unlocks geometry-aware distances and shortest paths that comply with the underlying data structure. This paper focuses on hyperbolic embeddings, a particularly suitable choice for modeling hierarchical relationships. Previous approaches relying on hyperbolic geodesics for interpolating the latent space often generate paths crossing low-data regions, leading to highly uncertain predictions. Instead, we propose augmenting the hyperbolic manifold with a pullback metric to account for distortions introduced by the LVM's nonlinear mapping and provide a complete development for pullback metrics of Gaussian Process LVMs (GPLVMs). Our experiments demonstrate that geodesics on the pullback metric not only respect the geometry of the hyperbolic latent space but also align with the underlying data distribution, significantly reducing uncertainty in predictions.
