Dynamic Prototype Adaptation with Distillation for Few-shot Point Cloud Segmentation
Jie Liu, Wenzhe Yin, Haochen Wang, Yunlu CHen, Jan-Jakob Sonke, Efstratios Gavves
TL;DR
Dynamic Prototype Adaptation (DPA) addresses the mismatch between support prototypes and query features in few-shot point cloud segmentation by learning task-specific prototypes for each query. It leverages three components—prototype rectification, prototype-to-query attention, and prototype distillation—to adapt vanilla support prototypes into query-specific representations, enabling accurate per-point masks via a non-parametric metric that compares per-point features to prototypes. Evaluated on the S3DIS and ScanNet benchmarks under the $N$-way $K$-shot setting, DPA delivers state-of-the-art performance, exemplified by improvements of $7.43\%$ on S3DIS and $6.39\%$ on ScanNet in the $2$-way $1$-shot setting over prior methods. The results demonstrate improved generalization to unseen classes and object variations with an end-to-end framework that remains practically efficient.
Abstract
Few-shot point cloud segmentation seeks to generate per-point masks for previously unseen categories, using only a minimal set of annotated point clouds as reference. Existing prototype-based methods rely on support prototypes to guide the segmentation of query point clouds, but they encounter challenges when significant object variations exist between the support prototypes and query features. In this work, we present dynamic prototype adaptation (DPA), which explicitly learns task-specific prototypes for each query point cloud to tackle the object variation problem. DPA achieves the adaptation through prototype rectification, aligning vanilla prototypes from support with the query feature distribution, and prototype-to-query attention, extracting task-specific context from query point clouds. Furthermore, we introduce a prototype distillation regularization term, enabling knowledge transfer between early-stage prototypes and their deeper counterparts during adaption. By iteratively applying these adaptations, we generate task-specific prototypes for accurate mask predictions on query point clouds. Extensive experiments on two popular benchmarks show that DPA surpasses state-of-the-art methods by a significant margin, e.g., 7.43\% and 6.39\% under the 2-way 1-shot setting on S3DIS and ScanNet, respectively. Code is available at https://github.com/jliu4ai/DPA.
