MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
Hengyuan Zhang, David Paz, Yuliang Guo, Xinyu Huang, Henrik I. Christensen, Liu Ren
TL;DR
The paper addresses cross-sensor generalization in online mapping for autonomous driving by introducing MapGS, a data-generation framework that uses Gaussian splatting to reconstruct scenes and render images in a target sensor configuration. By creating nuAV2—reconstructed AV2 data rendered into nuScenes views—and leveraging pretraining and joint training, the approach improves cross-configuration generalization, accelerates training, and reduces labeling needs, achieving notable gains even with limited target-domain data. The key contributions include the nuAV2 dataset, a data-regeneration recipe, and demonstrated improvements (e.g., an 18% performance boost and the ability to surpass Oracle performance with only 25% of target data). This work enables data reuse across sensor setups, offering a practical path toward scalable, surround-view online mapping with reduced labeling burden.
Abstract
Online mapping reduces the reliance of autonomous vehicles on high-definition (HD) maps, significantly enhancing scalability. However, recent advancements often overlook cross-sensor configuration generalization, leading to performance degradation when models are deployed on vehicles with different camera intrinsics and extrinsics. With the rapid evolution of novel view synthesis methods, we investigate the extent to which these techniques can be leveraged to address the sensor configuration generalization challenge. We propose a novel framework leveraging Gaussian splatting to reconstruct scenes and render camera images in target sensor configurations. The target config sensor data, along with labels mapped to the target config, are used to train online mapping models. Our proposed framework on the nuScenes and Argoverse 2 datasets demonstrates a performance improvement of 18% through effective dataset augmentation, achieves faster convergence and efficient training, and exceeds state-of-the-art performance when using only 25% of the original training data. This enables data reuse and reduces the need for laborious data labeling. Project page at https://henryzhangzhy.github.io/mapgs.
