SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes
Weixiao Gao, Liangliang Nan, Hugo Ledoux
TL;DR
SUM Parts addresses the lack of part-level semantic annotations for urban textured meshes by introducing a large-scale dataset and an efficient annotation tool. It combines face- and texture-based labeling, ground-truth for $21$ classes across $2.5 \,\text{km}^2$, and provides benchmarks for 3D semantic segmentation and interactive annotation. The work demonstrates that mesh-texture representations paired with template-driven annotation improve labeling efficiency and segmentation performance, with PointVector achieving state-of-the-art results on face and pixel tracks. This dataset enables finer urban modeling for smart cities, BIM workflows, and digital twins, complementing existing LiDAR/mesh datasets with richer, part-level semantics.
Abstract
Semantic segmentation in urban scene analysis has mainly focused on images or point clouds, while textured meshes - offering richer spatial representation - remain underexplored. This paper introduces SUM Parts, the first large-scale dataset for urban textured meshes with part-level semantic labels, covering about 2.5 km2 with 21 classes. The dataset was created using our own annotation tool, which supports both face- and texture-based annotations with efficient interactive selection. We also provide a comprehensive evaluation of 3D semantic segmentation and interactive annotation methods on this dataset. Our project page is available at https://tudelft3d.github.io/SUMParts/.
