Exploiting Inter-Sample Information for Long-tailed Out-of-Distribution Detection
Nimeshika Udayangani, Hadi M. Dolatabadi, Sarah Erfani, Christopher Leckie
TL;DR
This work tackles OOD detection under long-tailed ID distributions by building a graph that encodes inter-sample relationships using initial embeddings from a $k$-NN graph. It enhances these representations through Gaussianization of activations from a pre-trained backbone and refinement via a graph convolutional network, producing a discriminative feature space for LT-OOD detection. Evaluations on CIFAR10-LT, CIFAR100-LT, and ImageNet-LT show state-of-the-art improvements in OOD detection metrics (AUROC, AUPR, FPR95) and notably better tail-class ID accuracy, demonstrating robustness to distribution shifts. The approach offers practical impact for safe deployment in real-world LT settings and illustrates the value of combining pre-training, normalization alignment, and relational graph learning for OOD tasks.
Abstract
Detecting out-of-distribution (OOD) data is essential for safe deployment of deep neural networks (DNNs). This problem becomes particularly challenging in the presence of long-tailed in-distribution (ID) datasets, often leading to high false positive rates (FPR) and low tail-class ID classification accuracy. In this paper, we demonstrate that exploiting inter-sample relationships using a graph-based representation can significantly improve OOD detection in long-tailed recognition of vision datasets. To this end, we use the feature space of a pre-trained model to initialize our graph structure. We account for the differences between the activation layer distribution of the pre-training vs. training data, and actively introduce Gaussianization to alleviate any deviations from a standard normal distribution in the activation layers of the pre-trained model. We then refine this initial graph representation using graph convolutional networks (GCNs) to arrive at a feature space suitable for long-tailed OOD detection. This leads us to address the inferior performance observed in ID tail-classes within existing OOD detection methods. Experiments over three benchmarks CIFAR10-LT, CIFAR100-LT, and ImageNet-LT demonstrate that our method outperforms the state-of-the-art approaches by a large margin in terms of FPR and tail-class ID classification accuracy.
