ARCH2S: Dataset, Benchmark and Challenges for Learning Exterior Architectural Structures from Point Clouds
Ka Lung Cheung, Chi Chung Lee
TL;DR
The paper addresses the lack of fine-grained exterior 3D segmentation in outdoor point clouds and privacy-driven data gaps. It introduces ARCH2S, a semantically enriched, photorealistic 3D architectural models dataset with a semantic segmentation benchmark for exterior structures, drawing from real-world Hong Kong scenes and an open landscape. The dataset preparation includes model mining from the 3DBIT00 source, UV texture mapping, and sampling approximately 5M points per scene, with train/test splits and representative baselines (convolutional and transformer-based) evaluated using $OA$ and $mIoU$. The results highlight annotation inconsistencies, the superior performance of convolution-based methods over transformers on this task, and the challenges posed by imbalanced semantic distributions, underscoring the potential impact on BIM-enabled smart-city applications.
Abstract
Precise segmentation of architectural structures provides detailed information about various building components, enhancing our understanding and interaction with our built environment. Nevertheless, existing outdoor 3D point cloud datasets have limited and detailed annotations on architectural exteriors due to privacy concerns and the expensive costs of data acquisition and annotation. To overcome this shortfall, this paper introduces a semantically-enriched, photo-realistic 3D architectural models dataset and benchmark for semantic segmentation. It features 4 different building purposes of real-world buildings as well as an open architectural landscape in Hong Kong. Each point cloud is annotated into one of 14 semantic classes.
