BAM: Box Abstraction Monitors for Real-time OoD Detection in Object Detection
Changshun Wu, Weicheng He, Chih-Hong Cheng, Xiaowei Huang, Saddek Bensalem
TL;DR
BAM introduces box abstraction monitors that non-invasively detect OoD objects in real-time object detection by enclosing in-distribution features with a finite union of convex boxes. The method constructs per-class, layer-specific TBAs by clustering high-level features and enlarging boxes to achieve a target $FPR95$, enabling robust OoD rejection without retraining or architectural changes. Empirical results on KITTI, BDD100K, and multiple OoD datasets show BAM consistently lowers $FPR95$ compared to VOS, with negligible runtime overhead on GPUs. This approach offers a practical, scalable solution for safe, real-time perception in open-world environments.
Abstract
Out-of-distribution (OoD) detection techniques for deep neural networks (DNNs) become crucial thanks to their filtering of abnormal inputs, especially when DNNs are used in safety-critical applications and interact with an open and dynamic environment. Nevertheless, integrating OoD detection into state-of-the-art (SOTA) object detection DNNs poses significant challenges, partly due to the complexity introduced by the SOTA OoD construction methods, which require the modification of DNN architecture and the introduction of complex loss functions. This paper proposes a simple, yet surprisingly effective, method that requires neither retraining nor architectural change in object detection DNN, called Box Abstraction-based Monitors (BAM). The novelty of BAM stems from using a finite union of convex box abstractions to capture the learned features of objects for in-distribution (ID) data, and an important observation that features from OoD data are more likely to fall outside of these boxes. The union of convex regions within the feature space allows the formation of non-convex and interpretable decision boundaries, overcoming the limitations of VOS-like detectors without sacrificing real-time performance. Experiments integrating BAM into Faster R-CNN-based object detection DNNs demonstrate a considerably improved performance against SOTA OoD detection techniques.
