Affine Invariant Ensemble Transform Methods to Improve Predictive Uncertainty in Neural Networks
Diksha Bhandari, Jakiw Pidstrigach, Sebastian Reich
TL;DR
The paper develops two affine-invariant, ensemble Kalman filter–based interacting particle systems to perform Bayesian inference for logistic regression and to quantify predictive uncertainty in neural networks. By deriving mean-field limits and proving quantitative convergence rates, the authors provide a rigorous foundation for using these IPSs as scalable, derivative-free Bayesian samplers. Empirical results on binary and multiclass classification, including out-of-distribution data and CIFAR-10, show that last-layer Bayesian approximations yield well-calibrated uncertainty without sacrificing accuracy, with the deterministic second-order sampler offering faster convergence than traditional HMC. The work advances uncertainty quantification in neural networks by combining affine-invariant transforms with mean-field analysis, enabling robust predictions under distributional shifts.
Abstract
We consider the problem of performing Bayesian inference for logistic regression using appropriate extensions of the ensemble Kalman filter. Two interacting particle systems are proposed that sample from an approximate posterior and prove quantitative convergence rates of these interacting particle systems to their mean-field limit as the number of particles tends to infinity. Furthermore, we apply these techniques and examine their effectiveness as methods of Bayesian approximation for quantifying predictive uncertainty in neural networks.
