Adaptive Bounding Box Uncertainties via Two-Step Conformal Prediction
Alexander Timans, Christoph-Nikolas Straehle, Kaspar Sakmann, Eric Nalisnick
TL;DR
This work targets principled uncertainty quantification for multi-object detection by leveraging conformal prediction to produce per-object bounding-box intervals with guaranteed coverage at level $1-\alpha$. It introduces a two-step conformal pipeline that first constructs class-label prediction sets and then builds adaptive, per-coordinate bounding-box intervals, propagating class uncertainty to the box estimates. The method employs ensemble and quantile-based scoring (Box-Std, Box-Ens, Box-CQR) and a max-rank multiple-testing correction to maintain coverage while balancing interval width across object sizes, demonstrated on COCO, Cityscapes, and BDD100k with real-world detectors. The approach yields valid, actionable uncertainty estimates even when the detector misclassifies objects, enabling safer decisions in autonomy and robotics.
Abstract
Quantifying a model's predictive uncertainty is essential for safety-critical applications such as autonomous driving. We consider quantifying such uncertainty for multi-object detection. In particular, we leverage conformal prediction to obtain uncertainty intervals with guaranteed coverage for object bounding boxes. One challenge in doing so is that bounding box predictions are conditioned on the object's class label. Thus, we develop a novel two-step conformal approach that propagates uncertainty in predicted class labels into the uncertainty intervals of bounding boxes. This broadens the validity of our conformal coverage guarantees to include incorrectly classified objects, thus offering more actionable safety assurances. Moreover, we investigate novel ensemble and quantile regression formulations to ensure the bounding box intervals are adaptive to object size, leading to a more balanced coverage. Validating our two-step approach on real-world datasets for 2D bounding box localization, we find that desired coverage levels are satisfied with practically tight predictive uncertainty intervals.
