Localization and Expansion: A Decoupled Framework for Point Cloud Few-shot Semantic Segmentation
Zhaoyang Li, Yuan Wang, Wangkai Li, Rui Sun, Tianzhu Zhang
TL;DR
This paper tackles point-cloud few-shot semantic segmentation (PC-FSS) by identifying the limitations of direct point-level prototype matching, especially background confusion and intra-class diversity. It introduces Decoupled Localization and Expansion (DLE), comprising a Structural Localization Module that uses semantically aware agents for distribution-level matching and a Self-Expansion Module that expands localized regions using intra-object query cues with a conservative consistency check. The approach yields substantial performance gains on S3DIS and ScanNet across 1-shot and few-shot settings, demonstrating improved target localization and more complete foreground excavation. By combining structure-aware matching with query-driven expansion, DLE provides robust, data-efficient segmentation in complex 3D scenes.
Abstract
Point cloud few-shot semantic segmentation (PC-FSS) aims to segment targets of novel categories in a given query point cloud with only a few annotated support samples. The current top-performing prototypical learning methods employ prototypes originating from support samples to direct the classification of query points. However, the inherent fragility of point-level matching and the prevalent intra-class diversity pose great challenges to this cross-instance matching paradigm, leading to erroneous background activations or incomplete target excavation. In this work, we propose a simple yet effective framework in the spirit of Decoupled Localization and Expansion (DLE). The proposed DLE, including a structural localization module (SLM) and a self-expansion module (SEM), enjoys several merits. First, structural information is injected into the matching process through the agent-level correlation in SLM, and the confident target region can thus be precisely located. Second, more reliable intra-object similarity is harnessed in SEM to derive the complete target, and the conservative expansion strategy is introduced to reasonably constrain the expansion. Extensive experiments on two challenging benchmarks under different settings demonstrate that DLE outperforms previous state-of-the-art approaches by large margins.
