Quantifying Statistical Significance of Deep Nearest Neighbor Anomaly Detection via Selective Inference
Mizuki Niihori, Shuichi Nishino, Teruyuki Katsuoka, Tomohiro Shiraishi, Kouichi Taji, Ichiro Takeuchi
TL;DR
This work tackles uncertainty quantification for deep kNN-based anomaly detection in a semi-supervised setting by casting anomaly signaling as a statistical test and applying Selective Inference to obtain valid selective p-values. The method accounts for selection bias from both kNN neighborhood choice and deep feature computations, reducing the problem to a tractable truncated chi-square computation with conditioning. Empirical results on synthetic, tabular, and image data (including MVTec AD) show controlled false positive rates at α = 0.05 and superior power compared to baselines, with an open-source implementation released for reproducibility. The approach provides a practical framework for reliable anomaly detection in industrial and safety-critical contexts by delivering principled significance measures alongside detection scores.
Abstract
In real-world applications, anomaly detection (AD) often operates without access to anomalous data, necessitating semi-supervised methods that rely solely on normal data. Among these methods, deep k-nearest neighbor (deep kNN) AD stands out for its interpretability and flexibility, leveraging distance-based scoring in deep latent spaces.Despite its strong performance, deep kNN lacks a mechanism to quantify uncertainty-an essential feature for critical applications such as industrial inspection. To address this limitation, we propose a statistical framework that quantifies the significance of detected anomalies in the form of p-values, thereby enabling control over false positive rates at a user-specified significance level (e.g.,0.05). A central challenge lies in managing selection bias, which we tackle using Selective Inference-a principled method for conducting inference conditioned on data-driven selections. We evaluate our method on diverse datasets and demonstrate that it provides reliable AD well-suited for industrial use cases.
