Novel class discovery meets foundation models for 3D semantic segmentation
Luigi Riz, Cristiano Saltori, Yiming Wang, Elisa Ricci, Fabio Poiesi
TL;DR
This work addresses Novel Class Discovery for 3D point cloud semantic segmentation, a task challenging due to multiple novel classes per scene and 3D data sparsity. It introduces SNOPS, which combines online prototype-based pseudo-labelling, uncertainty-aware training, a class-balanced queue, and semantic distillation from a 3D foundation model to jointly learn base and novel classes. Compared to adapted 2D NCD baselines and zero-shot OpenScene prompts, SNOPS delivers substantial improvements across SemanticPOSS, SemanticKITTI, and S3DIS, supported by an explicit evaluation protocol. The approach demonstrates how semantically aligned feature spaces and online clustering can yield robust open-set performance in 3D semantic segmentation, with practical implications for scalable scene understanding. Future work includes relaxing the assumption of a known number of novel classes and addressing distillation-induced domain gaps.
Abstract
The task of Novel Class Discovery (NCD) in semantic segmentation entails training a model able to accurately segment unlabelled (novel) classes, relying on the available supervision from annotated (base) classes. Although extensively investigated in 2D image data, the extension of the NCD task to the domain of 3D point clouds represents a pioneering effort, characterized by assumptions and challenges that are not present in the 2D case. This paper represents an advancement in the analysis of point cloud data in four directions. Firstly, it introduces the novel task of NCD for point cloud semantic segmentation. Secondly, it demonstrates that directly transposing the only existing NCD method for 2D image semantic segmentation to 3D data yields suboptimal results. Thirdly, a new NCD approach based on online clustering, uncertainty estimation, and semantic distillation is presented. Lastly, a novel evaluation protocol is proposed to rigorously assess the performance of NCD in point cloud semantic segmentation. Through comprehensive evaluations on the SemanticKITTI, SemanticPOSS, and S3DIS datasets, the paper demonstrates substantial superiority of the proposed method over the considered baselines.
