Beyond Pixels: Semi-Supervised Semantic Segmentation with a Multi-scale Patch-based Multi-Label Classifier
Prantik Howlader, Srijan Das, Hieu Le, Dimitris Samaras
TL;DR
This work addresses the limited contextual information in semi-supervised semantic segmentation by introducing a plug-in Multi-scale Patch-based Multi-label Classifier (MPMC) that provides patch-level supervision across multiple scales. MPMC also derives adaptive per-patch weights from patch confidences to mitigate the impact of noisy teacher pseudo-labels within a teacher–student framework, enabling joint training of the segmentation network and the MPMC module. Evaluations on Cityscapes, Pascal VOC, and ACDC show consistent improvements across four SSS baselines, with especially strong gains in low-label settings and across both natural and medical datasets. By enforcing explicit patch-context preservation and selective weighting, MPMC enhances discrimination between neighboring classes and supports more reliable learning from unlabeled data.
Abstract
Incorporating pixel contextual information is critical for accurate segmentation. In this paper, we show that an effective way to incorporate contextual information is through a patch-based classifier. This patch classifier is trained to identify classes present within an image region, which facilitates the elimination of distractors and enhances the classification of small object segments. Specifically, we introduce Multi-scale Patch-based Multi-label Classifier (MPMC), a novel plug-in module designed for existing semi-supervised segmentation (SSS) frameworks. MPMC offers patch-level supervision, enabling the discrimination of pixel regions of different classes within a patch. Furthermore, MPMC learns an adaptive pseudo-label weight, using patch-level classification to alleviate the impact of the teacher's noisy pseudo-label supervision the student. This lightweight module can be integrated into any SSS framework, significantly enhancing their performance. We demonstrate the efficacy of our proposed MPMC by integrating it into four SSS methodologies and improving them across two natural image and one medical segmentation dataset, notably improving the segmentation results of the baselines across all the three datasets.
