Optimal Subspace Inference for the Laplace Approximation of Bayesian Neural Networks
Josua Faller, Jörg Martin
TL;DR
Addresses tractable uncertainty quantification for Bayesian neural networks by deriving an optimal affine subspace inference within the Laplace approximation and providing a practical construction workflow. The key theoretical result shows an optimal subspace that can recover the full epistemic covariance in a rank-s form, enabling reliable uncertainty estimates with far fewer parameters. Empirically, low-rank subspace constructions, especially using KFAC-based approximations, outperform subset-based methods across regression and classification, and a trace-based criterion effectively ranks subspaces. The work offers a scalable, principled path to subspace-based Bayes posteriors in large networks, with limitations tied to the quality of the posterior-covariance approximations and storage of projection matrices.
Abstract
Subspace inference for neural networks assumes that a subspace of their parameter space suffices to produce a reliable uncertainty quantification. In this work, we mathematically derive the optimal subspace model to a Bayesian inference scenario based on the Laplace approximation. We demonstrate empirically that, in the optimal case, often a fraction of parameters less than 1% is sufficient to obtain a reliable estimate of the full Laplace approximation. Since the optimal solution is derived, we can evaluate all other subspace models against a baseline. In addition, we give an approximation of our method that is applicable to larger problem settings, in which the optimal solution is not computable, and compare it to existing subspace models from the literature. In general, our approximation scheme outperforms previous work. Furthermore, we present a metric to qualitatively compare different subspace models even if the exact Laplace approximation is unknown.
