RankFeat&RankWeight: Rank-1 Feature/Weight Removal for Out-of-distribution Detection
Yue Song, Wei Wang, Nicu Sebe
TL;DR
This work presents RankFeat and RankWeight as simple post hoc OOD detection techniques that remove rank-1 subspaces from high-level feature maps and from the last-layer weight matrices, respectively. RankFeat perturbs OOD predictions by subtracting the dominant singular component, yielding strong separation between ID and OOD data, while RankWeight provides a cheap, single-layer weight perturbation that can also boost other OOD detectors when used as a plugin. The authors provide extensive theoretical analyses, including upper-bound reductions and connections to random matrix theory and ReAct, along with comprehensive experiments across CNNs and vision transformers on ImageNet-1k and a large-scale Species dataset, achieving state-of-the-art FPR95 and AUROC. The method is broadly compatible with existing OOD approaches and offers practical benefits, including low computational overhead and easy integration, with public code available.
Abstract
The task of out-of-distribution (OOD) detection is crucial for deploying machine learning models in real-world settings. In this paper, we observe that the singular value distributions of the in-distribution (ID) and OOD features are quite different: the OOD feature matrix tends to have a larger dominant singular value than the ID feature, and the class predictions of OOD samples are largely determined by it. This observation motivates us to propose \texttt{RankFeat}, a simple yet effective \emph{post hoc} approach for OOD detection by removing the rank-1 matrix composed of the largest singular value and the associated singular vectors from the high-level feature. \texttt{RankFeat} achieves \emph{state-of-the-art} performance and reduces the average false positive rate (FPR95) by 17.90\% compared with the previous best method. The success of \texttt{RankFeat} motivates us to investigate whether a similar phenomenon would exist in the parameter matrices of neural networks. We thus propose \texttt{RankWeight} which removes the rank-1 weight from the parameter matrices of a single deep layer. Our \texttt{RankWeight}is also \emph{post hoc} and only requires computing the rank-1 matrix once. As a standalone approach, \texttt{RankWeight} has very competitive performance against other methods across various backbones. Moreover, \texttt{RankWeight} enjoys flexible compatibility with a wide range of OOD detection methods. The combination of \texttt{RankWeight} and \texttt{RankFeat} refreshes the new \emph{state-of-the-art} performance, achieving the FPR95 as low as 16.13\% on the ImageNet-1k benchmark. Extensive ablation studies and comprehensive theoretical analyses are presented to support the empirical results. Code is publicly available via \url{https://github.com/KingJamesSong/RankFeat}.
