Few-Shot Segmentation with Global and Local Contrastive Learning
Weide Liu, Zhonghua Wu, Henghui Ding, Fayao Liu, Jie Lin, Guosheng Lin, Wei Zhou
TL;DR
This work tackles few-shot segmentation by decoupling query information from the support guidance and extracting priors directly from unlabeled query images using a global-local contrastive learning framework. A prior extractor produces query priors, which, together with a cross-correspondence module that fuses support-guided cues, enables effective query-mask prediction. The approach yields state-of-the-art results on both PASCAL-5i and MS COCO and is supported by extensive ablations showing the benefits of local patch-based contrastive learning and SLIC-based patch generation. The method offers practical impact by reducing reliance on labeled support and enhancing generalization to novel classes in segmentation tasks.
Abstract
In this work, we address the challenging task of few-shot segmentation. Previous few-shot segmentation methods mainly employ the information of support images as guidance for query image segmentation. Although some works propose to build cross-reference between support and query images, their extraction of query information still depends on the support images. We here propose to extract the information from the query itself independently to benefit the few-shot segmentation task. To this end, we first propose a prior extractor to learn the query information from the unlabeled images with our proposed global-local contrastive learning. Then, we extract a set of predetermined priors via this prior extractor. With the obtained priors, we generate the prior region maps for query images, which locate the objects, as guidance to perform cross interaction with support features. In such a way, the extraction of query information is detached from the support branch, overcoming the limitation by support, and could obtain more informative query clues to achieve better interaction. Without bells and whistles, the proposed approach achieves new state-of-the-art performance for the few-shot segmentation task on PASCAL-5$^{i}$ and COCO datasets.
