Walking on the Fiber: A Simple Geometric Approximation for Bayesian Neural Networks
Alfredo Reichlin, Miguel Vasco, Danica Kragic
TL;DR
The paper tackles the challenge of intractable posteriors in Bayesian neural networks by introducing MetricBNN, which first explores the local MAP neighborhood via a lightweight drift-plus-gradient sampling scheme and then learns a structured latent posterior to enable rapid, non-iterative sampling. This two-stage approach captures non-Gaussian, low-dimensional loss-landscape geometry that traditional Laplace approximations miss, providing a scalable alternative to refinement-based posteriors. Across toy tasks, UCI datasets, and high-dimensional image and OOD benchmarks, MetricBNN achieves competitive negative log-likelihood and calibration (ECE) while reducing computational overhead compared to Hessian-based methods, HMC, and deep ensembles. The work demonstrates the practical potential of coupling geometry-aware sampling with autoencoder-based latent representations for robust Bayesian uncertainty quantification in deep learning.
Abstract
Bayesian Neural Networks provide a principled framework for uncertainty quantification by modeling the posterior distribution of network parameters. However, exact posterior inference is computationally intractable, and widely used approximations like the Laplace method struggle with scalability and posterior accuracy in modern deep networks. In this work, we revisit sampling techniques for posterior exploration, proposing a simple variation tailored to efficiently sample from the posterior in over-parameterized networks by leveraging the low-dimensional structure of loss minima. Building on this, we introduce a model that learns a deformation of the parameter space, enabling rapid posterior sampling without requiring iterative methods. Empirical results demonstrate that our approach achieves competitive posterior approximations with improved scalability compared to recent refinement techniques. These contributions provide a practical alternative for Bayesian inference in deep learning.
