Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy
Will LeVine, Benjamin Pikus, Jacob Phillips, Berk Norman, Fernando Amat Gil, Sean Hendryx
TL;DR
The paper introduces Ablated Learned Temperature Energy (AbeT), an OOD detection method that replaces a single scalar temperature with an input-dependent learned temperature and removes the Forefront Temperature Constant from the energy score. AbeT achieves state-of-the-art or competitive results across classification, semantic segmentation, and object detection, notably reducing FPR@95 and increasing AUROC/AUPRC on standard OOD benchmarks. The authors provide empirical and visual evidence that leveraging misclassified ID examples during training helps AbeT distinguish ID from OOD without using explicit OOD data. The approach is lightweight, requiring only a small architectural change (a learned temperature module) and a targeted ablation, with potential for extension to large-language models and vision-language models.
Abstract
As deep neural networks become adopted in high-stakes domains, it is crucial to identify when inference inputs are Out-of-Distribution (OOD) so that users can be alerted of likely drops in performance and calibration despite high confidence -- ultimately to know when networks' decisions (and their uncertainty in those decisions) should be trusted. In this paper we introduce Ablated Learned Temperature Energy (or "AbeT" for short), an OOD detection method which lowers the False Positive Rate at 95\% True Positive Rate (FPR@95) by $43.43\%$ in classification compared to state of the art without training networks in multiple stages or requiring hyperparameters or test-time backward passes. We additionally provide empirical insights as to why our model learns to distinguish between In-Distribution (ID) and OOD samples while only being explicitly trained on ID samples via exposure to misclassified ID examples at training time. Lastly, we show the efficacy of our method in identifying predicted bounding boxes and pixels corresponding to OOD objects in object detection and semantic segmentation, respectively -- with an AUROC increase of $5.15\%$ in object detection and both a decrease in FPR@95 of $41.48\%$ and an increase in AUPRC of $34.20\%$ in semantic segmentation compared to previous state of the art.
