Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers

Kevin Raina; Tanya Schmah

Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers

Kevin Raina, Tanya Schmah

TL;DR

The paper tackles OOD detection under limited training data by leveraging Bayesian Neural Networks to quantify model uncertainty and by designing post-hoc OOD scores that operate in logit space. It introduces a new logit-space k-NN score, plus a class-conditioned variant, and demonstrates that Bayesian adaptations of deterministic scores outperform their non-Bayesian counterparts on MNIST and CIFAR-10 with small data. The results show that logit-based Bayesian scores, particularly the EL kNN+ variant, provide robust OOD discrimination, while prior expressivity influences the relative performance of predictive entropy versus mutual information. Overall, the work highlights the practical value of uncertainty-aware, prior-informed models for reliable OOD detection in low-data regimes and points to logit-space representations as a rich signal for uncertainty.

Abstract

Out-of-Distribution (OOD) detection is critical to AI reliability and safety, yet in many practical settings, only a limited amount of training data is available. Bayesian Neural Networks (BNNs) are a promising class of model on which to base OOD detection, because they explicitly represent epistemic (i.e. model) uncertainty. In the small training data regime, BNNs are especially valuable because they can incorporate prior model information. We introduce a new family of Bayesian posthoc OOD scores based on expected logit vectors, and compare 5 Bayesian and 4 deterministic posthoc OOD scores. Experiments on MNIST and CIFAR-10 In-Distributions, with 5000 training samples or less, show that the Bayesian methods outperform corresponding deterministic methods.

Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers

TL;DR

Abstract

Out-of-Distribution Detection from Small Training Sets using Bayesian Neural Network Classifiers

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)