Mining In-distribution Attributes in Outliers for Out-of-distribution Detection
Yutian Lei, Luping Ji, Pei Liu
TL;DR
This paper addresses unreliable predictions in out-of-distribution (OOD) detection by revealing that OOD samples often contain in-distribution (ID) attributes. It introduces the Extended Multi-view Data Model (MVDM) and a MaxLogit-based OOD score to interpret ID attributes in outliers, along with MVOL, a multi-view learning objective that calibrates logits using auxiliary OOD data while respecting ID feature structure. Theoretical results show favorable bounds for both single-model and ensemble-distillation regimes, and empirical results demonstrate that MVOL outperforms strong baselines on CIFAR benchmarks and wild datasets, while preserving ID accuracy. This approach offers robust, calibrated OOD detection applicable even when auxiliary data are noisy or scarce, with practical implications for deploying reliable systems.
Abstract
Out-of-distribution (OOD) detection is indispensable for deploying reliable machine learning systems in real-world scenarios. Recent works, using auxiliary outliers in training, have shown good potential. However, they seldom concern the intrinsic correlations between in-distribution (ID) and OOD data. In this work, we discover an obvious correlation that OOD data usually possesses significant ID attributes. These attributes should be factored into the training process, rather than blindly suppressed as in previous approaches. Based on this insight, we propose a structured multi-view-based out-of-distribution detection learning (MVOL) framework, which facilitates rational handling of the intrinsic in-distribution attributes in outliers. We provide theoretical insights on the effectiveness of MVOL for OOD detection. Extensive experiments demonstrate the superiority of our framework to others. MVOL effectively utilizes both auxiliary OOD datasets and even wild datasets with noisy in-distribution data. Code is available at https://github.com/UESTC-nnLab/MVOL.
