TrueCity: Real and Simulated Urban Data for Cross-Domain 3D Scene Understanding
Duc Nguyen, Yan-Ling Lai, Qilin Zhang, Prabin Gyawali, Benedikt Schwab, Olaf Wysocki, Thomas H. Kolbe
TL;DR
TrueCity provides a first-of-its-kind benchmark with cm-accurate real-world LiDAR, a CityGML-aligned semantic city model, and synchronized synthetic LiDAR for the same urban area to enable direct quantification of the sim-to-real domain gap in 3D semantic segmentation. By evaluating a broad set of baselines across synthetic-real data mixtures, the work reveals a pronounced, class-dependent domain shift and shows that a balanced mix of synthetic and real data can enhance performance for transformer-based architectures, while locality-based methods rely more on real data. The dataset aligns with international standards to facilitate downstream integration into CityGML/OpenDRIVE workflows and reveals practical insights, such as certain large, regular classes exhibiting minimal domain gaps and several fine-grained classes requiring real data for robust segmentation. Limitations include the absence of radiometric information and dynamic objects, motivating future work to incorporate lighting/material effects and moving scene data to broaden sim-to-real gap analyses and generalizability.
Abstract
3D semantic scene understanding remains a long-standing challenge in the 3D computer vision community. One of the key issues pertains to limited real-world annotated data to facilitate generalizable models. The common practice to tackle this issue is to simulate new data. Although synthetic datasets offer scalability and perfect labels, their designer-crafted scenes fail to capture real-world complexity and sensor noise, resulting in a synthetic-to-real domain gap. Moreover, no benchmark provides synchronized real and simulated point clouds for segmentation-oriented domain shift analysis. We introduce TrueCity, the first urban semantic segmentation benchmark with cm-accurate annotated real-world point clouds, semantic 3D city models, and annotated simulated point clouds representing the same city. TrueCity proposes segmentation classes aligned with international 3D city modeling standards, enabling consistent evaluation of synthetic-to-real gap. Our extensive experiments on common baselines quantify domain shift and highlight strategies for exploiting synthetic data to enhance real-world 3D scene understanding. We are convinced that the TrueCity dataset will foster further development of sim-to-real gap quantification and enable generalizable data-driven models. The data, code, and 3D models are available online: https://tum-gis.github.io/TrueCity/
