DPGLA: Bridging the Gap between Synthetic and Real Data for Unsupervised Domain Adaptation in 3D LiDAR Semantic Segmentation
Wanmeng Li, Simone Mosco, Daniel Fusaro, Alberto Pretto
TL;DR
DPGLA tackles unsupervised domain adaptation for 3D LiDAR semantic segmentation by integrating a dynamic pseudo-label filtering scheme (DPLF), a non-learned prior-guided data augmentation pipeline (PG-DAP), and a data mixing consistency loss within a Mean Teacher framework. DPLF adapts global and class-specific confidence thresholds using EMA-driven statistics to balance pseudo-label quality and class representation, while PG-DAP employs Density-Aware Sampling and jitter strategies to bridge input-level disparities without heavy computation. The data mixing consistency loss enforces context-free representations on mixed source/target samples, and LaserMix-based domain mixing further promotes robust cross-domain learning. Across two synthetic-to-real benchmarks, DPGLA achieves state-of-the-art performance and ablations show each component contributes to improved robustness and accuracy, with released code enabling reproducibility and future extensions.
Abstract
Annotating real-world LiDAR point clouds for use in intelligent autonomous systems is costly. To overcome this limitation, self-training-based Unsupervised Domain Adaptation (UDA) has been widely used to improve point cloud semantic segmentation by leveraging synthetic point cloud data. However, we argue that existing methods do not effectively utilize unlabeled data, as they either rely on predefined or fixed confidence thresholds, resulting in suboptimal performance. In this paper, we propose a Dynamic Pseudo-Label Filtering (DPLF) scheme to enhance real data utilization in point cloud UDA semantic segmentation. Additionally, we design a simple and efficient Prior-Guided Data Augmentation Pipeline (PG-DAP) to mitigate domain shift between synthetic and real-world point clouds. Finally, we utilize data mixing consistency loss to push the model to learn context-free representations. We implement and thoroughly evaluate our approach through extensive comparisons with state-of-the-art methods. Experiments on two challenging synthetic-to-real point cloud semantic segmentation tasks demonstrate that our approach achieves superior performance. Ablation studies confirm the effectiveness of the DPLF and PG-DAP modules. We release the code of our method in this paper.
