Variational Learning of Gaussian Process Latent Variable Models through Stochastic Gradient Annealed Importance Sampling
Jian Xu, Shian Du, Junmei Yang, Qianli Ma, Delu Zeng
TL;DR
The paper tackles the problem of loose variational bounds and weight collapse in high-dimensional GPLVMs by introducing VAIS-GPLVM, a variational approach that leverages Annealed Importance Sampling with time-inhomogeneous Unadjusted Langevin Diffusion to transform the posterior into a sequence of bridging distributions. It derives a tractable AIS-based ELBO via reparameterization, enabling stochastic gradient optimization, and demonstrates tighter bounds, higher log-likelihoods, and more robust convergence on toy and image datasets compared to MF and IW baselines. An extensive set of experiments shows improved reconstruction performance, uncertainty handling for unseen data with missing values, and significantly better ESS/weight-entropy metrics, indicating reduced weight degeneracy. The work offers a scalable, principled framework for variational learning in GPLVMs, with potential impact on dimensionality reduction and missing data recovery in complex, high-dimensional settings.
Abstract
Gaussian Process Latent Variable Models (GPLVMs) have become increasingly popular for unsupervised tasks such as dimensionality reduction and missing data recovery due to their flexibility and non-linear nature. An importance-weighted version of the Bayesian GPLVMs has been proposed to obtain a tighter variational bound. However, this version of the approach is primarily limited to analyzing simple data structures, as the generation of an effective proposal distribution can become quite challenging in high-dimensional spaces or with complex data sets. In this work, we propose an Annealed Importance Sampling (AIS) approach to address these issues. By transforming the posterior into a sequence of intermediate distributions using annealing, we combine the strengths of Sequential Monte Carlo samplers and VI to explore a wider range of posterior distributions and gradually approach the target distribution. We further propose an efficient algorithm by reparameterizing all variables in the evidence lower bound (ELBO). Experimental results on both toy and image datasets demonstrate that our method outperforms state-of-the-art methods in terms of tighter variational bounds, higher log-likelihoods, and more robust convergence.
