Towards One Model for Classical Dimensionality Reduction: A Probabilistic Perspective on UMAP and t-SNE
Aditya Ravuri, Neil D. Lawrence
TL;DR
The paper addresses the lack of a unified probabilistic understanding of popular dimensionality reduction methods like UMAP and t-SNE. It proposes recasting these algorithms as maximum-a-posteriori in a Wishart model of the graph Laplacian, where latent coordinates define a non-linear kernel-based covariance, linking ProbDR with Gaussian process latent variable models. Key contributions include a simplified ProbDR framework, a concrete distributional interpretation for UMAP/t-SNE via a non-linear kernel, and demonstrated connections to Laplacian Eigenmaps and GPLVMs. This work provides theoretical grounding for DR methods, enables principled incorporation of prior information, and offers a path toward kernel-informed, scalable embeddings.
Abstract
This paper shows that dimensionality reduction methods such as UMAP and t-SNE, can be approximately recast as MAP inference methods corresponding to a model introduced in Ravuri et al. (2023), that describes the graph Laplacian (an estimate of the data precision matrix) using a Wishart distribution, with a mean given by a non-linear covariance function evaluated on the latents. This interpretation offers deeper theoretical and semantic insights into such algorithms, and forging a connection to Gaussian process latent variable models by showing that well-known kernels can be used to describe covariances implied by graph Laplacians. We also introduce tools with which similar dimensionality reduction methods can be studied.
