CADRef: Robust Out-of-Distribution Detection via Class-Aware Decoupled Relative Feature Leveraging
Zhiwei Ling, Yachen Chang, Hailiang Zhao, Xinkui Zhao, Kingsum Chow, Shuiguang Deng
TL;DR
This work tackles the persistent issue of overconfidence by DNNs on out-of-distribution data. It introduces CARef, which uses class-aware average features to compute a relative feature error as an OOD score, and CADRef, which extends CARef with feature decoupling and error scaling that integrate logit-based signals for stronger discrimination. Across ImageNet-1k and CIFAR benchmarks and multiple architectures, CADRef (and CARef) achieve robust improvements in AUROC and FPR95 compared to state-of-the-art baselines, maintaining strong performance even when some logit-based scores falter. The approach highlights the value of combining feature-space cues with logits while carefully decomposing feature contributions, offering a practical, robust framework for post-hoc OOD detection with broad applicability.
Abstract
Deep neural networks (DNNs) have been widely criticized for their overconfidence when dealing with out-of-distribution (OOD) samples, highlighting the critical need for effective OOD detection to ensure the safe deployment of DNNs in real-world settings. Existing post-hoc OOD detection methods primarily enhance the discriminative power of logit-based approaches by reshaping sample features, yet they often neglect critical information inherent in the features themselves. In this paper, we propose the Class-Aware Relative Feature-based method (CARef), which utilizes the error between a sample's feature and its class-aware average feature as a discriminative criterion. To further refine this approach, we introduce the Class-Aware Decoupled Relative Feature-based method (CADRef), which decouples sample features based on the alignment of signs between the relative feature and corresponding model weights, enhancing the discriminative capabilities of CARef. Extensive experimental results across multiple datasets and models demonstrate that both proposed methods exhibit effectiveness and robustness in OOD detection compared to state-of-the-art methods. Specifically, our two methods outperform the best baseline by 2.82% and 3.27% in AUROC, with improvements of 4.03% and 6.32% in FPR95, respectively.
