Incorporating Unlabelled Data into Bayesian Neural Networks
Mrinank Sharma, Tom Rainforth, Yee Whye Teh, Vincent Fortuin
TL;DR
This work tackles the limitation that conventional Bayesian Neural Networks cannot utilize unlabelled data to improve predictions. It introduces Self-Supervised BNNs (SS-BNNs), which learn a function-space prior from unlabelled data by generating pseudo-labelled contrastive tasks and optimizing a variational bound on the marginal likelihood. The learned priors yield prior predictives that better capture semantic similarity, leading to improved predictive performance, particularly in low-label regimes, and enhanced robustness to out-of-distribution data and active-learning scenarios. The approach provides a principled Bayesian interpretation of contrastive learning and demonstrates that unlabelled data can meaningfully inform function priors, with practical impact for label-efficient learning and uncertainty quantification.
Abstract
Conventional Bayesian Neural Networks (BNNs) are unable to leverage unlabelled data to improve their predictions. To overcome this limitation, we introduce Self-Supervised Bayesian Neural Networks, which use unlabelled data to learn models with suitable prior predictive distributions. This is achieved by leveraging contrastive pretraining techniques and optimising a variational lower bound. We then show that the prior predictive distributions of self-supervised BNNs capture problem semantics better than conventional BNN priors. In turn, our approach offers improved predictive performance over conventional BNNs, especially in low-budget regimes.
