S^4M: Boosting Semi-Supervised Instance Segmentation with SAM
Heeji Yoon, Heeseong Shin, Eunbeen Hong, Hyunwook Choi, Hansang Cho, Daun Jeong, Seungryong Kim
TL;DR
S^4M tackles semi-supervised instance segmentation under limited labeling by integrating SAM into a teacher–student framework through three core components: structural distillation to imprint SAM’s fine-grained localization into the teacher, pseudo-label refinement to improve labeling quality on unlabeled data, and instance-aware augmentation (ARP) to generate diverse, realistic training samples. The method carefully leverages SAM’s strengths while mitigating its class-agnostic tendencies, yielding state-of-the-art results on Cityscapes and COCO at very low label ratios. Extensive ablations show that distilling decoder-based self-similarity, refining pseudo-labels, and combining ARP with a strong teacher produce the largest gains, with additional insights into when and how to apply each component. Overall, S^4M demonstrates that SAM can significantly enhance semi-supervised instance segmentation when integrated with targeted distillation and augmentation strategies, offering practical improvements for data-scarce scenarios.
Abstract
Semi-supervised instance segmentation poses challenges due to limited labeled data, causing difficulties in accurately localizing distinct object instances. Current teacher-student frameworks still suffer from performance constraints due to unreliable pseudo-label quality stemming from limited labeled data. While the Segment Anything Model (SAM) offers robust segmentation capabilities at various granularities, directly applying SAM to this task introduces challenges such as class-agnostic predictions and potential over-segmentation. To address these complexities, we carefully integrate SAM into the semi-supervised instance segmentation framework, developing a novel distillation method that effectively captures the precise localization capabilities of SAM without compromising semantic recognition. Furthermore, we incorporate pseudo-label refinement as well as a specialized data augmentation with the refined pseudo-labels, resulting in superior performance. We establish state-of-the-art performance, and provide comprehensive experiments and ablation studies to validate the effectiveness of our proposed approach.
