Uncertainty-Aware Online Extrinsic Calibration: A Conformal Prediction Approach
Mathieu Cocheteux, Julien Moreau, Franck Davoine
TL;DR
This work tackles the problem of quantifying and leveraging uncertainty in online extrinsic calibration for multi-sensor fusion. It proposes a pipeline that combines Monte Carlo Dropout for model uncertainty with Conformal Prediction to produce prediction intervals that guarantee coverage at a specified level, while remaining compatible with existing calibration models. Key contributions include a CP-based framework tailored to online extrinsic calibration, a practical integration with deep learning backbones, and extensive validation on KITTI and DSEC across RGB-LiDAR and Event-LiDAR modalities, along with ablations that highlight the necessity of CP over MCD alone and the impact of sensor resolution. The approach enhances robustness and interpretability of calibration in dynamic environments, enabling decisions such as triggering recalibration or adjusting sensor-fusion confidence, which is critical for safe and reliable autonomous systems.
Abstract
Accurate sensor calibration is crucial for autonomous systems, yet its uncertainty quantification remains underexplored. We present the first approach to integrate uncertainty awareness into online extrinsic calibration, combining Monte Carlo Dropout with Conformal Prediction to generate prediction intervals with a guaranteed level of coverage. Our method proposes a framework to enhance existing calibration models with uncertainty quantification, compatible with various network architectures. Validated on KITTI (RGB Camera-LiDAR) and DSEC (Event Camera-LiDAR) datasets, we demonstrate effectiveness across different visual sensor types, measuring performance with adapted metrics to evaluate the efficiency and reliability of the intervals. By providing calibration parameters with quantifiable confidence measures, we offer insights into the reliability of calibration estimates, which can greatly improve the robustness of sensor fusion in dynamic environments and usefully serve the Computer Vision community.
