FlowCon: Out-of-Distribution Detection using Flow-Based Contrastive Learning
Saandeep Aathreya, Shaun Canavan
TL;DR
FlowCon tackles OOD detection by marrying normalizing flows with a supervised contrastive loss to learn class-conditioned densities on penultimate classifier features. The core idea is to maximize the flow-based likelihood while a Bhattacharyya-inspired contrastive loss enforces tight, class-specific clusters, yielding robust in-distribution coverage and clear separation from out-of-distribution samples. The approach trains a single flow model on fixed classifier features, avoiding retraining or external OOD data, and computes OOD scores from per-class density estimates, achieving strong performance across far-OOD, near-OOD, and mixed scenarios on CIFAR-10/100 with ResNet18 and WideResNet backbones. Empirical results, likelihood histograms, and UMAP visualizations demonstrate FlowCon’s discriminative density structure, class-preserving behavior, and practical viability for real-world deployment in vision tasks.
Abstract
Identifying Out-of-distribution (OOD) data is becoming increasingly critical as the real-world applications of deep learning methods expand. Post-hoc methods modify softmax scores fine-tuned on outlier data or leverage intermediate feature layers to identify distinctive patterns between In-Distribution (ID) and OOD samples. Other methods focus on employing diverse OOD samples to learn discrepancies between ID and OOD. These techniques, however, are typically dependent on the quality of the outlier samples assumed. Density-based methods explicitly model class-conditioned distributions but this requires long training time or retraining the classifier. To tackle these issues, we introduce \textit{FlowCon}, a new density-based OOD detection technique. Our main innovation lies in efficiently combining the properties of normalizing flow with supervised contrastive learning, ensuring robust representation learning with tractable density estimation. Empirical evaluation shows the enhanced performance of our method across common vision datasets such as CIFAR-10 and CIFAR-100 pretrained on ResNet18 and WideResNet classifiers. We also perform quantitative analysis using likelihood plots and qualitative visualization using UMAP embeddings and demonstrate the robustness of the proposed method under various OOD contexts. Code will be open-sourced post decision.
