Uncertainty-Aware Out-of-Distribution Detection with Gaussian Processes
Yang Chen, Chih-Li Sung, Arpan Kusari, Xiaoyang Song, Wenbo Sun
TL;DR
The paper tackles OOD detection for $K$-class classification by introducing a Gaussian-process-based detector that requires no OOD data during training. It emulates the DNN mapping from an intermediate representation to unconstrained Softmax scores with class-specific GPs, then uses a KL-divergence–based score to distinguish InD from OOD samples, with thresholds learned solely from InD data. Key contributions include (i) a practical InD-only OOD detector compatible with standard architectures, (ii) a probabilistic framework that provides predictive uncertainty for each class, and (iii) extensive experiments showing strong TNR and AUROC on both conventional and large-scale real-world datasets. The approach offers a principled, uncertainty-aware alternative to data-tuned or generative OOD methods, with potential impact on safety-critical deployments where OOD data is rare or unavailable during training.
Abstract
Deep neural networks (DNNs) are often constructed under the closed-world assumption, which may fail to generalize to the out-of-distribution (OOD) data. This leads to DNNs producing overconfident wrong predictions and can result in disastrous consequences in safety-critical applications. Existing OOD detection methods mainly rely on curating a set of OOD data for model training or hyper-parameter tuning to distinguish OOD data from training data (also known as in-distribution data or InD data). However, OOD samples are not always available during the training phase in real-world applications, hindering the OOD detection accuracy. To overcome this limitation, we propose a Gaussian-process-based OOD detection method to establish a decision boundary based on InD data only. The basic idea is to perform uncertainty quantification of the unconstrained softmax scores of a DNN via a multi-class Gaussian process (GP), and then define a score function to separate InD and potential OOD data based on their fundamental differences in the posterior predictive distribution from the GP. Two case studies on conventional image classification datasets and real-world image datasets are conducted to demonstrate that the proposed method outperforms the state-of-the-art OOD detection methods when OOD samples are not observed in the training phase.
