Self-Supervised Learning with Gaussian Processes
Yunshan Duan, Sinead Williamson
TL;DR
GPSSL introduces a Gaussian process prior on representations to enforce smoothness without relying on positive/negative pairs, enabling uncertainty-aware self-supervised learning. It formulates a generalized Bayesian posterior via generalized variational inference, adopts VICReg-style variance and covariance losses (without an explicit invariance term), and links GPSSL to kernel PCA and VICReg. Empirical results show GPSSL achieves competitive or superior performance on tabular and real-world data while providing meaningful uncertainty quantification in downstream tasks and in out-of-sample regions. The framework is particularly suited for structured data such as tabular, graphs, and spatial transcriptomics, where uncertainty in representations can be propagated to predictions and decision making.
Abstract
Self supervised learning (SSL) is a machine learning paradigm where models learn to understand the underlying structure of data without explicit supervision from labeled samples. The acquired representations from SSL have demonstrated useful for many downstream tasks including clustering, and linear classification, etc. To ensure smoothness of the representation space, most SSL methods rely on the ability to generate pairs of observations that are similar to a given instance. However, generating these pairs may be challenging for many types of data. Moreover, these methods lack consideration of uncertainty quantification and can perform poorly in out-of-sample prediction settings. To address these limitations, we propose Gaussian process self supervised learning (GPSSL), a novel approach that utilizes Gaussian processes (GP) models on representation learning. GP priors are imposed on the representations, and we obtain a generalized Bayesian posterior minimizing a loss function that encourages informative representations. The covariance function inherent in GPs naturally pulls representations of similar units together, serving as an alternative to using explicitly defined positive samples. We show that GPSSL is closely related to both kernel PCA and VICReg, a popular neural network-based SSL method, but unlike both allows for posterior uncertainties that can be propagated to downstream tasks. Experiments on various datasets, considering classification and regression tasks, demonstrate that GPSSL outperforms traditional methods in terms of accuracy, uncertainty quantification, and error control.
