Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques

Luca Barco; Giacomo Blanco; Gaetano Chiriaco; Alessia Intini; Luigi La Riccia; Vittorio Scolamiero; Piero Boccardo; Paolo Garza; Fabrizio Dominici

Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques

Luca Barco, Giacomo Blanco, Gaetano Chiriaco, Alessia Intini, Luigi La Riccia, Vittorio Scolamiero, Piero Boccardo, Paolo Garza, Fabrizio Dominici

TL;DR

We address label scarcity in urban 3D semantic segmentation by introducing Turin3D, a high-density aerial LiDAR dataset over Turin with unlabeled training data and manually annotated validation/test sets. The study benchmarks multiple architectures under transfer learning and proposes an iterative semi-supervised pseudo-labeling strategy, demonstrating that adaptive confidence thresholds yield substantial gains over a transfer-learning baseline. RandLA-Net emerges as the most effective backbone for cross-city adaptation, and the adaptive semi-supervised approach achieves an absolute improvement of $9.76$ mIoU, highlighting the value of unlabeled data when ground truth is scarce. The dataset and findings aim to promote data-efficient learning and domain adaptation for large-scale urban LiDAR segmentation, with potential extensions in annotation, taxonomy, and modality fusion.

Abstract

3D semantic segmentation plays a critical role in urban modelling, enabling detailed understanding and mapping of city environments. In this paper, we introduce Turin3D: a new aerial LiDAR dataset for point cloud semantic segmentation covering an area of around 1.43 km2 in the city centre of Turin with almost 70M points. We describe the data collection process and compare Turin3D with others previously proposed in the literature. We did not fully annotate the dataset due to the complexity and time-consuming nature of the process; however, a manual annotation process was performed on the validation and test sets, to enable a reliable evaluation of the proposed techniques. We first benchmark the performances of several point cloud semantic segmentation models, trained on the existing datasets, when tested on Turin3D, and then improve their performances by applying a semi-supervised learning technique leveraging the unlabelled training set. The dataset will be publicly available to support research in outdoor point cloud segmentation, with particular relevance for self-supervised and semi-supervised learning approaches given the absence of ground truth annotations for the training set.

Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques

TL;DR

Abstract

Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)