Table of Contents
Fetching ...

Hilbert space methods for approximating multi-output latent variable Gaussian processes

Soham Mukherjee, Manfred Claassen, Paul-Christian Bürkner

TL;DR

This work tackles GP scalability for high-dimensional, multi-output, and latent-input settings by extending Hilbert space Gaussian processes (HSGPs) to model cross-output correlations and latent inputs through spectral-density basis expansions. The approach yields linear scalability in the number of data points and basis functions, enabling Bayesian inference via MCMC or variational methods while maintaining strong uncertainty calibration. Through extensive simulations and a real scRNA pseudotime case study, the authors show that HSGPs offer superior latent-variable estimation accuracy and calibration compared to exact GPs and inducing-point VI methods, with practical speed benefits over exact inference. The method provides a scalable, calibration-friendly alternative for complex GP scenarios in genomics and related domains, though it includes considerations about priors, output-correlation modeling, and potential extensions to derivative information.

Abstract

Gaussian processes are a powerful class of non-linear models, but have limited applicability for larger datasets due to their high computational complexity. In such cases, approximate methods are required, for example, the recently developed class of Hilbert space Gaussian processes. They have been shown to significantly reduce computation time while retaining most of the favorable properties of exact Gaussian processes. However, Hilbert space approximations have so far only been developed for uni-dimensional outputs and manifest (known) inputs. Thus, we generalize Hilbert space methods to multi-output and latent input settings. Through extensive simulations, we show that the developed approximate Gaussian processes are indeed not only faster, but also provide similar or even better uncertainty calibration and accuracy of latent variable estimates compared to exact Gaussian processes. While not necessarily faster than alternative Gaussian process approximations, our new models provide better calibration and estimation accuracy, thus striking an excellent balance between trustworthiness and speed. We additionally illustrate our methods on a real-world case study from single cell biology.

Hilbert space methods for approximating multi-output latent variable Gaussian processes

TL;DR

This work tackles GP scalability for high-dimensional, multi-output, and latent-input settings by extending Hilbert space Gaussian processes (HSGPs) to model cross-output correlations and latent inputs through spectral-density basis expansions. The approach yields linear scalability in the number of data points and basis functions, enabling Bayesian inference via MCMC or variational methods while maintaining strong uncertainty calibration. Through extensive simulations and a real scRNA pseudotime case study, the authors show that HSGPs offer superior latent-variable estimation accuracy and calibration compared to exact GPs and inducing-point VI methods, with practical speed benefits over exact inference. The method provides a scalable, calibration-friendly alternative for complex GP scenarios in genomics and related domains, though it includes considerations about priors, output-correlation modeling, and potential extensions to derivative information.

Abstract

Gaussian processes are a powerful class of non-linear models, but have limited applicability for larger datasets due to their high computational complexity. In such cases, approximate methods are required, for example, the recently developed class of Hilbert space Gaussian processes. They have been shown to significantly reduce computation time while retaining most of the favorable properties of exact Gaussian processes. However, Hilbert space approximations have so far only been developed for uni-dimensional outputs and manifest (known) inputs. Thus, we generalize Hilbert space methods to multi-output and latent input settings. Through extensive simulations, we show that the developed approximate Gaussian processes are indeed not only faster, but also provide similar or even better uncertainty calibration and accuracy of latent variable estimates compared to exact Gaussian processes. While not necessarily faster than alternative Gaussian process approximations, our new models provide better calibration and estimation accuracy, thus striking an excellent balance between trustworthiness and speed. We additionally illustrate our methods on a real-world case study from single cell biology.

Paper Structure

This paper contains 21 sections, 30 equations, 34 figures, 2 tables.

Figures (34)

  • Figure 1: Squared exponential scenario: Convergence check for (a) latent inputs and (b) GP hyperparameters of the exact GPs (for $N = 20$) and HSGPs (combined for $N = 20, 50 \text{ } \text{and} \text{ } 200$ cases). The y-axes for Bulk and Tail ESS plots are log10 transformed.
  • Figure 2: Squared exponential scenario: ECDF-difference calibration plots of the latent $x$ estimated by exact GP, HSGP, and VIGP. Only the HSGP is consistently well calibrated.
  • Figure 3: Squared exponential scenario: log $\gamma$ scores offset by the 95% confidence threshold for all the fitted models. The behavior of scores across true latent $x$ values are shown in the right-hand panel. The blue dashed line denotes the threshold to reject uniformity, that is, models with values less than $0$ are miscalibrated. The HSGP($M$) shows the HSGPs with their corresponding number of basis functions.
  • Figure 4: Squared exponential scenario (highly varying $\rho$): log $\gamma$ scores offset by the 95% confidence threshold for all the fitted models. The behavior of scores across true latent $x$ values are shown in the right-hand panel. The blue dashed line denotes the threshold to reject uniformity , that is, models with values less than $0$ are miscalibrated. The HSGP($M$) shows the HSGPs with their corresponding number of basis functions.
  • Figure 5: Squared exponential scenario: posterior bias and SD on recovery of latent inputs for all fitted models. The HSGP($M$) shows the HSGPs with their corresponding number of basis functions.
  • ...and 29 more figures