3DGS-Calib: 3D Gaussian Splatting for Multimodal SpatioTemporal Calibration
Quentin Herau, Moussab Bennehar, Arthur Moreau, Nathan Piasco, Luis Roldao, Dzmitry Tsishkou, Cyrille Migniot, Pascal Vasseur, Cédric Demonceaux
TL;DR
The paper tackles fast, accurate multimodal spatiotemporal calibration for LiDAR and cameras by replacing slow NeRF-based representations with 3D Gaussian Splatting. It introduces 3DGS-Calib, which uses LiDAR-derived Gaussians as a fixed geometric scaffold and a shared MLP to predict per-Gaussian parameters, optimized via a differentiable photometric loss that aligns multi-sensor data. Through preprocessing tricks like accumulated LiDAR downsampling, progressive voxelization, image cropping, and scale regularization, it achieves substantial speed-ups while maintaining or improving accuracy, as demonstrated on KITTI-360 sequences. The approach outperforms NeRF-based methods and classical baselines in both spatiotemporal and LiDAR-camera calibration tasks, highlighting its practical potential for online, in-the-wild sensor fusion.
Abstract
Reliable multimodal sensor fusion algorithms require accurate spatiotemporal calibration. Recently, targetless calibration techniques based on implicit neural representations have proven to provide precise and robust results. Nevertheless, such methods are inherently slow to train given the high computational overhead caused by the large number of sampled points required for volume rendering. With the recent introduction of 3D Gaussian Splatting as a faster alternative to implicit representation methods, we propose to leverage this new rendering approach to achieve faster multi-sensor calibration. We introduce 3DGS-Calib, a new calibration method that relies on the speed and rendering accuracy of 3D Gaussian Splatting to achieve multimodal spatiotemporal calibration that is accurate, robust, and with a substantial speed-up compared to methods relying on implicit neural representations. We demonstrate the superiority of our proposal with experimental results on sequences from KITTI-360, a widely used driving dataset.
