Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers
Kevin Raina, Tanya Schmah
TL;DR
The paper tackles OOD detection under limited training data by leveraging Bayesian Neural Networks to quantify model uncertainty and by designing post-hoc OOD scores that operate in logit space. It introduces a new logit-space k-NN score, plus a class-conditioned variant, and demonstrates that Bayesian adaptations of deterministic scores outperform their non-Bayesian counterparts on MNIST and CIFAR-10 with small data. The results show that logit-based Bayesian scores, particularly the EL kNN+ variant, provide robust OOD discrimination, while prior expressivity influences the relative performance of predictive entropy versus mutual information. Overall, the work highlights the practical value of uncertainty-aware, prior-informed models for reliable OOD detection in low-data regimes and points to logit-space representations as a rich signal for uncertainty.
Abstract
Out-of-Distribution (OOD) detection is critical to AI reliability and safety, yet in many practical settings, only a limited amount of training data is available. Bayesian Neural Networks (BNNs) are a promising class of model on which to base OOD detection, because they explicitly represent epistemic (i.e. model) uncertainty. In the small training data regime, BNNs are especially valuable because they can incorporate prior model information. We introduce a new family of Bayesian posthoc OOD scores based on expected logit vectors, and compare 5 Bayesian and 4 deterministic posthoc OOD scores. Experiments on MNIST and CIFAR-10 In-Distributions, with 5000 training samples or less, show that the Bayesian methods outperform corresponding deterministic methods.
