Synth It Like KITTI: Synthetic Data Generation for Object Detection in Driving Scenarios
Richard Marcus, Christian Vogel, Inga Jatzkowski, Niklas Knoop, Marc Stamminger
TL;DR
This work tackles the synthetic-to-real transfer problem for 3D LiDAR object detection in driving scenarios by building a CARLA-based data-generation pipeline with domain randomization and explicit sensor modeling. It demonstrates that carefully modeled LiDAR intensity, environment and vehicle behavior variations, and bounding-box adjustments can enable synthetic data to generalize to KITTI, with fine-tuning on a small real-data subset nearly bridging the gap. The authors provide a modular, open pipeline and multiple sensor variants to study what drives the domain gap, showing that sensor realism and randomization have substantial impact on transfer performance. The approach yields practical benefits for synthetic pretraining and rapid adaptation to real-world data, highlighting pathways for scalable, domain-robust perception in autonomous driving.
Abstract
An important factor in advancing autonomous driving systems is simulation. Yet, there is rather small progress for transferability between the virtual and real world. We revisit this problem for 3D object detection on LiDAR point clouds and propose a dataset generation pipeline based on the CARLA simulator. Utilizing domain randomization strategies and careful modeling, we are able to train an object detector on the synthetic data and demonstrate strong generalization capabilities to the KITTI dataset. Furthermore, we compare different virtual sensor variants to gather insights, which sensor attributes can be responsible for the prevalent domain gap. Finally, fine-tuning with a small portion of real data almost matches the baseline and with the full training set slightly surpasses it.
