P2Seg: Pointly-supervised Segmentation via Mutual Distillation
Zipeng Wang, Xuehui Yu, Xumeng Han, Wenwen Yu, Zhixun Huang, Jianbin Jiao, Zhenjun Han
TL;DR
This work tackles point-level supervised instance segmentation by introducing a Mutual Distillation Module (MDM) that jointly leverages semantic boundaries and instance-level cues. Through two mutually reinforcing branches, Semantic to Instance (S2I) and Instance to Semantic (I2S), the method transfers information in both directions to produce high-quality instance maps without pre-trained proposals. The approach achieves state-of-the-art performance on VOC 2012 and competitive results on COCO for PSIS, with notable gains from ablative studies confirming the contributions of S2I and I2S. By effectively fusing semantic and instance information, the method reduces annotation costs while enhancing boundary precision and intra-class discrimination, signaling practical impact for scalable segmentation.
Abstract
Point-level Supervised Instance Segmentation (PSIS) aims to enhance the applicability and scalability of instance segmentation by utilizing low-cost yet instance-informative annotations. Existing PSIS methods usually rely on positional information to distinguish objects, but predicting precise boundaries remains challenging due to the lack of contour annotations. Nevertheless, weakly supervised semantic segmentation methods are proficient in utilizing intra-class feature consistency to capture the boundary contours of the same semantic regions. In this paper, we design a Mutual Distillation Module (MDM) to leverage the complementary strengths of both instance position and semantic information and achieve accurate instance-level object perception. The MDM consists of Semantic to Instance (S2I) and Instance to Semantic (I2S). S2I is guided by the precise boundaries of semantic regions to learn the association between annotated points and instance contours. I2S leverages discriminative relationships between instances to facilitate the differentiation of various objects within the semantic map. Extensive experiments substantiate the efficacy of MDM in fostering the synergy between instance and semantic information, consequently improving the quality of instance-level object representations. Our method achieves 55.7 mAP$_{50}$ and 17.6 mAP on the PASCAL VOC and MS COCO datasets, significantly outperforming recent PSIS methods and several box-supervised instance segmentation competitors.
