Fixed-Mean Gaussian Processes for Post-hoc Bayesian Deep Learning
Luis A. Ortega, Simón Rodríguez-Santana, Daniel Hernández-Lobato
TL;DR
The paper tackles post-hoc uncertainty estimation for pre-trained deep neural networks by fixing the Gaussian Process mean to the DNN output using a universal kernel. It introduces Fixed-Mean Gaussian Processes (FMGP), a decoupled sparse GP variational framework that fixes the mean while learning variances, avoiding DNN Jacobians and enabling scalable uncertainty estimates on large datasets. Across synthetic problems, CIFAR10, ImageNet, and QM9, FMGP achieves robust uncertainty calibration and competitive or superior predictive performance with favorable training and inference times compared to state-of-the-art post-hoc methods. The approach is architecture-agnostic and scalable, offering a practical pathway to robust post-hoc Bayesian deep learning with broad applicability and potential for kernel customization.
Abstract
Recently, there has been an increasing interest in performing post-hoc uncertainty estimation about the predictions of pre-trained deep neural networks (DNNs). Given a pre-trained DNN via back-propagation, these methods enhance the original network by adding output confidence measures, such as error bars, without compromising its initial accuracy. In this context, we introduce a novel family of sparse variational Gaussian processes (GPs), where the posterior mean is fixed to any continuous function when using a universal kernel. Specifically, we fix the mean of this GP to the output of the pre-trained DNN, allowing our approach to effectively fit the GP's predictive variances to estimate the DNN prediction uncertainty. Our approach leverages variational inference (VI) for efficient stochastic optimization, with training costs that remain independent of the number of training points, scaling efficiently to large datasets such as ImageNet. The proposed method, called fixed mean GP (FMGP), is architecture-agnostic, relying solely on the pre-trained model's outputs to adjust the predictive variances. Experimental results demonstrate that FMGP improves both uncertainty estimation and computational efficiency when compared to state-of-the-art methods.
