Diffusion based Semantic Outlier Generation via Nuisance Awareness for Out-of-Distribution Detection
Suhee Yoon, Sanghyu Yoon, Ye Seul Sim, Sungik Choi, Kyungeun Lee, Hye-Seung Cho, Hankook Lee, Woohyung Lim
TL;DR
This work tackles the challenge of detecting near-OOD samples by generating challenging outliers from pixel-space ID images using diffusion through region-aware guidance. The core idea, SONA, employs input deformation, semantic and nuisance region masking, and a region-specific guidance term Δ_SONA to create outliers with explicit semantic discrepancies yet preserved nuisances. The training objective combines cross-entropy on ID data, outlier exposure, and mutual information minimization to encourage semantic separation while reducing nuisance leakage. Empirically, SONA delivers state-of-the-art near-OOD AUROC (≈$88.4\%$) across ImageNet-based benchmarks and demonstrates robustness to stop-timestep choices and OOD prompts, offering a practical, scalable approach to robust OOD detection.
Abstract
Out-of-distribution (OOD) detection, which determines whether a given sample is part of the in-distribution (ID), has recently shown promising results through training with synthetic OOD datasets. Nonetheless, existing methods often produce outliers that are considerably distant from the ID, showing limited efficacy for capturing subtle distinctions between ID and OOD. To address these issues, we propose a novel framework, Semantic Outlier generation via Nuisance Awareness (SONA), which notably produces challenging outliers by directly leveraging pixel-space ID samples through diffusion models. Our approach incorporates SONA guidance, providing separate control over semantic and nuisance regions of ID samples. Thereby, the generated outliers achieve two crucial properties: (i) they present explicit semantic-discrepant information, while (ii) maintaining various levels of nuisance resemblance with ID. Furthermore, the improved OOD detector training with SONA outliers facilitates learning with a focus on semantic distinctions. Extensive experiments demonstrate the effectiveness of our framework, achieving an impressive AUROC of 88% on near-OOD datasets, which surpasses the performance of baseline methods by a significant margin of approximately 6%.
