Image-based Outlier Synthesis With Training Data
Sudarshan Regmi
TL;DR
This work tackles the challenge of detecting out-of-distribution inputs under spurious correlations and fine-grained similarities without relying on external outlier data. It introduces ASCOOD, a two-stage framework that first synthesizes virtual outliers by perturbing invariant features using gradient attributions and then trains with a joint objective on standardized features to balance ID accuracy and uncertainty toward OOD. The method achieves state-of-the-art OOD detection across spurious, fine-grained, and conventional settings on seven datasets, including large-scale ImageNet-100, without external data. Overall, ASCOOD offers a data-efficient, practical pathway toward safer deployment of deep models in open-world scenarios by mitigating reliance on spurious cues and enhancing sensitivity to subtle distribution shifts.
Abstract
Out-of-distribution (OOD) detection is critical to ensure the safe deployment of deep learning models in critical applications. Deep learning models can often misidentify OOD samples as in-distribution (ID) samples. This vulnerability worsens in the presence of spurious correlation in the training set. Likewise, in fine-grained classification settings, detection of fine-grained OOD samples becomes inherently challenging due to their high similarity to ID samples. However, current research on OOD detection has focused instead largely on relatively easier (conventional) cases. Even the few recent works addressing these challenging cases rely on carefully curated or synthesized outliers, ultimately requiring external data. This motivates our central research question: ``Can we innovate OOD detection training framework for fine-grained and spurious settings \textbf{without requiring any external data at all?}" In this work, we present a unified \textbf{A}pproach to \textbf{S}purious, fine-grained, and \textbf{C}onventional \textbf{OOD D}etection (\textbf{\ASCOOD}) that eliminates the reliance on external data. First, we synthesize virtual outliers from ID data by approximating the destruction of invariant features. Specifically, we propose to add gradient attribution values to ID inputs to disrupt invariant features while amplifying true-class logit, thereby synthesizing challenging near-manifold virtual outliers. Then, we simultaneously incentivize ID classification and predictive uncertainty towards virtual outliers. For this, we further propose to leverage standardized features with z-score normalization. ASCOOD effectively mitigates impact of spurious correlations and encourages capturing fine-grained attributes. Extensive experiments across \textbf{7} datasets and and comparisons with \textbf{30+} methods demonstrate merit of ASCOOD in spurious, fine-grained and conventional settings.
