Exploring Scene Affinity for Semi-Supervised LiDAR Semantic Segmentation
Chuandong Liu, Xingxing Weng, Shuguo Jiang, Pengcheng Li, Lei Yu, Gui-Song Xia
TL;DR
This work tackles semi-supervised LiDAR semantic segmentation for driving scenes by addressing intra- and inter-scene affinity. It introduces AIScene, which uses a point erasure strategy to enforce intra-scene consistency and a patch-based augmentation pipeline (MixPatch and InsFill) to exploit inter-scene correlations across multiple scenes. The method operates within a teacher–student framework with EMA updates and a pseudo-label threshold, achieving state-of-the-art gains on SemanticKITTI and nuScenes, particularly at very low labeled data (1%). The results illustrate that both the erasure mechanism and the multi-scene augmentation independently and jointly improve learning from unlabeled data, offering practical improvements for reducing labeling demands in autonomous-driving perception systems.
Abstract
This paper explores scene affinity (AIScene), namely intra-scene consistency and inter-scene correlation, for semi-supervised LiDAR semantic segmentation in driving scenes. Adopting teacher-student training, AIScene employs a teacher network to generate pseudo-labeled scenes from unlabeled data, which then supervise the student network's learning. Unlike most methods that include all points in pseudo-labeled scenes for forward propagation but only pseudo-labeled points for backpropagation, AIScene removes points without pseudo-labels, ensuring consistency in both forward and backward propagation within the scene. This simple point erasure strategy effectively prevents unsupervised, semantically ambiguous points (excluded in backpropagation) from affecting the learning of pseudo-labeled points. Moreover, AIScene incorporates patch-based data augmentation, mixing multiple scenes at both scene and instance levels. Compared to existing augmentation techniques that typically perform scene-level mixing between two scenes, our method enhances the semantic diversity of labeled (or pseudo-labeled) scenes, thereby improving the semi-supervised performance of segmentation models. Experiments show that AIScene outperforms previous methods on two popular benchmarks across four settings, achieving notable improvements of 1.9% and 2.1% in the most challenging 1% labeled data.
