All for One, and One for All: UrbanSyn Dataset, the third Musketeer of Synthetic Driving Scenes
Jose L. Gómez, Manuel Silva, Antonio Seoane, Agnès Borrás, Mario Noriega, Germán Ros, Jose A. Iglesias-Guitian, Antonio M. López
TL;DR
UrbanSyn tackles the data-labeling bottleneck in autonomous driving by delivering photorealistic synthetic urban scenes with pixel-perfect ground truth, including depth, semantics, and instances. Generated with unbiased path tracing and GIS-informed layouts, UrbanSyn complements GTAV and Synscapes as part of The Three Musketeers to close the synth-to-real gap via multisource unsupervised domain adaptation. Across Cityscapes, BDD100K, and Mapillary Vistas, UrbanSyn strengthens semantic segmentation baselines and, when combined with the other datasets, achieves state-of-the-art synth-to-real UDA performance using HRDA and co-training. The dataset is openly accessible, enabling broad reuse for tasks such as instance segmentation and depth estimation, and guiding future work in active learning and data-centric AI for autonomous driving.
Abstract
We introduce UrbanSyn, a photorealistic dataset acquired through semi-procedurally generated synthetic urban driving scenarios. Developed using high-quality geometry and materials, UrbanSyn provides pixel-level ground truth, including depth, semantic segmentation, and instance segmentation with object bounding boxes and occlusion degree. It complements GTAV and Synscapes datasets to form what we coin as the 'Three Musketeers'. We demonstrate the value of the Three Musketeers in unsupervised domain adaptation for image semantic segmentation. Results on real-world datasets, Cityscapes, Mapillary Vistas, and BDD100K, establish new benchmarks, largely attributed to UrbanSyn. We make UrbanSyn openly and freely accessible (www.urbansyn.org).
