Kernel PCA for Out-of-Distribution Detection: Non-Linear Kernel Selections and Approximations

Kun Fang; Qinghua Tao; Mingzhen He; Kexin Lv; Runze Yang; Haibo Hu; Xiaolin Huang; Jie Yang; Longbin Cao

Kernel PCA for Out-of-Distribution Detection: Non-Linear Kernel Selections and Approximations

Kun Fang, Qinghua Tao, Mingzhen He, Kexin Lv, Runze Yang, Haibo Hu, Xiaolin Huang, Jie Yang, Longbin Cao

TL;DR

This work reframes OoD detection as learning a discriminative non-linear subspace of InD features via Kernel PCA (KPCA). It introduces a Cosine-Gaussian kernel to capture two key non-linear patterns relating InD and OoD distributions and provides two scalable kernel-approximation schemes, Random Fourier Features and a data-dependent Nyström method, to compute reconstruction errors efficiently. An energy-based Nyström sampling strategy further enhances subspace learning by focusing on boundary regions between InD and OoD. Empirically, the proposed KPCA framework achieves state-of-the-art OoD detection performance on ImageNet-1K with ResNet-50 and ViT, while maintaining low inference cost and memory; the work also offers detailed analysis of kernel choices, sampling schemes, and hyper-parameter sensitivity. Overall, the approach provides a practical, kernel-design-oriented pathway to robust OoD detection in large-scale deep learning systems.

Abstract

Out-of-Distribution (OoD) detection is vital for the reliability of deep neural networks, the key of which lies in effectively characterizing the disparities between OoD and In-Distribution (InD) data. In this work, such disparities are exploited through a fresh perspective of non-linear feature subspace. That is, a discriminative non-linear subspace is learned from InD features to capture representative patterns of InD, while informative patterns of OoD features cannot be well captured in such a subspace due to their different distribution. Grounded on this perspective, we exploit the deviations of InD and OoD features in such a non-linear subspace for effective OoD detection. To be specific, we leverage the framework of Kernel Principal Component Analysis (KPCA) to attain the discriminative non-linear subspace and deploy the reconstruction error on such subspace to distinguish InD and OoD data. Two challenges emerge: (i) the learning of an effective non-linear subspace, i.e., the selection of kernel function in KPCA, and (ii) the computation of the kernel matrix with large-scale InD data. For the former, we reveal two vital non-linear patterns that closely relate to the InD-OoD disparity, leading to the establishment of a Cosine-Gaussian kernel for constructing the subspace. For the latter, we introduce two techniques to approximate the Cosine-Gaussian kernel with significantly cheap computations. In particular, our approximation is further tailored by incorporating the InD data confidence, which is demonstrated to promote the learning of discriminative subspaces for OoD data. Our study presents new insights into the non-linear feature subspace for OoD detection and contributes practical explorations on the associated kernel design and efficient computations, yielding a KPCA detection method with distinctively improved efficacy and efficiency.

Kernel PCA for Out-of-Distribution Detection: Non-Linear Kernel Selections and Approximations

TL;DR

Abstract

Kernel PCA for Out-of-Distribution Detection: Non-Linear Kernel Selections and Approximations

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (2)