Prototype-based Optimal Transport for Out-of-Distribution Detection
Ao Ke, Wenlong Chen, Chuanwen Feng, Yukun Cao, Xike Xie, S. Kevin Zhou, Lei Feng
TL;DR
The paper addresses the challenge of detecting out-of-distribution inputs in deep networks by measuring distribution discrepancy between test samples and in-distribution (ID) prototypes. It introduces Prototype-based Optimal Transport (POT), which uses OT between test representations and class prototypes to obtain per-sample transport costs, and augments this with linear-extrapolated virtual outliers to boost detection of near-ID OOD samples. The final OOD score is a contrast between the transport costs to ID prototypes and to the virtual outliers, computed efficiently via entropic regularization and the Sinkhorn-Knopp algorithm. Empirically, POT achieves state-of-the-art performance on CIFAR-100 and ImageNet-1k across Far-OOD and Near-OOD benchmarks, is compatible with various training-time schemes, and remains effective even when training data are unavailable, highlighting its practical impact for reliable deployment.
Abstract
Detecting Out-of-Distribution (OOD) inputs is crucial for improving the reliability of deep neural networks in the real-world deployment. In this paper, inspired by the inherent distribution shift between ID and OOD data, we propose a novel method that leverages optimal transport to measure the distribution discrepancy between test inputs and ID prototypes. The resulting transport costs are used to quantify the individual contribution of each test input to the overall discrepancy, serving as a desirable measure for OOD detection. To address the issue that solely relying on the transport costs to ID prototypes is inadequate for identifying OOD inputs closer to ID data, we generate virtual outliers to approximate the OOD region via linear extrapolation. By combining the transport costs to ID prototypes with the costs to virtual outliers, the detection of OOD data near ID data is emphasized, thereby enhancing the distinction between ID and OOD inputs. Experiments demonstrate the superiority of our method over state-of-the-art methods.
