EigenScore: OOD Detection using Covariance in Diffusion Models
Shirin Shoushtari, Yi Wang, Xiao Shi, M. Salman Asif, Ulugbek S. Kamilov
TL;DR
Problem: OOD detection in diffusion-model pipelines is critical for safety, yet existing likelihood or score-based metrics can be unstable or misorder inputs. Approach: EigenScore uses the eigen-spectrum of the denoising posterior covariance, $Cov_p[{\bm{x}}|{\bm{x}}_t]$, to quantify distribution shift, estimated via a Jacobian-free subspace iteration on the forward denoiser. Contributions: a theoretical link between distribution shift and excess denoising error through the posterior covariance, a practical method to compute leading eigenvalues without explicit Jacobians, and strong empirical AUROC gains including in near-OOD settings. Significance: delivers a principled, scalable OOD detector for diffusion-based systems with real-world impact in safety-critical tasks.
Abstract
Out-of-distribution (OOD) detection is critical for the safe deployment of machine learning systems in safety-sensitive domains. Diffusion models have recently emerged as powerful generative models, capable of capturing complex data distributions through iterative denoising. Building on this progress, recent work has explored their potential for OOD detection. We propose EigenScore, a new OOD detection method that leverages the eigenvalue spectrum of the posterior covariance induced by a diffusion model. We argue that posterior covariance provides a consistent signal of distribution shift, leading to larger trace and leading eigenvalues on OOD inputs, yielding a clear spectral signature. We further provide analysis explicitly linking posterior covariance to distribution mismatch, establishing it as a reliable signal for OOD detection. To ensure tractability, we adopt a Jacobian-free subspace iteration method to estimate the leading eigenvalues using only forward evaluations of the denoiser. Empirically, EigenScore achieves SOTA performance, with up to 5% AUROC improvement over the best baseline. Notably, it remains robust in near-OOD settings such as CIFAR-10 vs CIFAR-100, where existing diffusion-based methods often fail.
