Environment-Driven Online LiDAR-Camera Extrinsic Calibration
Zhiwei Huang, Jiaqi Li, Hongbo Zhao, Xiao Ma, Ping Zhong, Xiaohu Zhou, Wei Ye, Rui Fan
TL;DR
This work tackles online LiDAR-Camera extrinsic calibration in unstructured environments by proposing EdO-LCEC, an environment-driven framework that adapts to scene feature density. A generalizable scene discriminator creates multiple virtual cameras to enrich cross-modal features, while Dual-Path Correspondence Matching (DPCM) leverages both structural and textural cues to generate dense 3D-2D correspondences. These correspondences are fused through a multi-view and multi-scene optimization to robustly estimate the extrinsic transform $^{C}_{L}oldsymbol{T}$, improving accuracy in sparse and limited overlap conditions. Extensive experiments on KITTI, KITTI360, nuScenes, and MIAS-LCEC demonstrate state-of-the-art performance and strong robustness, with ablations confirming the contributions of the scene discriminator and DPCM. The approach offers practical impact for reliable multi-modal fusion in autonomous systems operating in diverse real-world environments.
Abstract
LiDAR-camera extrinsic calibration (LCEC) is crucial for multi-modal data fusion in autonomous robotic systems. Existing methods, whether target-based or target-free, typically rely on customized calibration targets or fixed scene types, which limit their applicability in real-world scenarios. To address these challenges, we present EdO-LCEC, the first environment-driven online calibration approach. Unlike traditional target-free methods, EdO-LCEC employs a generalizable scene discriminator to estimate the feature density of the application environment. Guided by this feature density, EdO-LCEC extracts LiDAR intensity and depth features from varying perspectives to achieve higher calibration accuracy. To overcome the challenges of cross-modal feature matching between LiDAR and camera, we introduce dual-path correspondence matching (DPCM), which leverages both structural and textural consistency for reliable 3D-2D correspondences. Furthermore, we formulate the calibration process as a joint optimization problem that integrates global constraints across multiple views and scenes, thereby enhancing overall accuracy. Extensive experiments on real-world datasets demonstrate that EdO-LCEC outperforms state-of-the-art methods, particularly in scenarios involving sparse point clouds or partially overlapping sensor views.
