Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds
Jiacheng Wei, Guosheng Lin, Kim-Hui Yap, Fayao Liu, Tzu-Yi Hung
TL;DR
This work tackles 3D point cloud semantic segmentation under weak supervision by densely propagating sparse labels through cross-sample and intra-sample feature reallocating. It introduces a two-stage training framework: first, Cross-Sample Feature Reallocating (CSFR) transfers features and gradients across input pairs, then Intra-Sample Feature Reallocating (ISFR) propagates supervision within each sample, with corresponding cross- and self-regularization losses. By leveraging a KPConv-based backbone and decoupling the two modules during training, the method achieves performance close to fully supervised baselines on S3DIS and ScanNet using only 10% or 1% labeled points. The approach significantly reduces annotation costs while maintaining practical accuracy, offering a scalable path for dense 3D segmentation in real-world scenes.
Abstract
Semantic segmentation on 3D point clouds is an important task for 3D scene understanding. While dense labeling on 3D data is expensive and time-consuming, only a few works address weakly supervised semantic point cloud segmentation methods to relieve the labeling cost by learning from simpler and cheaper labels. Meanwhile, there are still huge performance gaps between existing weakly supervised methods and state-of-the-art fully supervised methods. In this paper, we train a semantic point cloud segmentation network with only a small portion of points being labeled. We argue that we can better utilize the limited supervision information as we densely propagate the supervision signal from the labeled points to other points within and across the input samples. Specifically, we propose a cross-sample feature reallocating module to transfer similar features and therefore re-route the gradients across two samples with common classes and an intra-sample feature redistribution module to propagate supervision signals on unlabeled points across and within point cloud samples. We conduct extensive experiments on public datasets S3DIS and ScanNet. Our weakly supervised method with only 10% and 1% of labels can produce compatible results with the fully supervised counterpart.
