Semi-Supervised Semantic Segmentation via Marginal Contextual Information
Moshe Kimhi, Shai Kimhi, Evgenii Zheltonozhskii, Or Litany, Chaim Baskin
TL;DR
This work targets efficient semi-supervised semantic segmentation by addressing confirmation bias in pseudo labeling. It introduces S4MC, a teacher–student framework that employs dynamic margin-based pseudo labeling and a Marginal Contextual Information refinement to leverage neighboring pixel predictions. Across VOC, Cityscapes, and COCO, S4MC achieves state-of-the-art gains in low-label settings (e.g., +1.39 mIoU on VOC 12 with 366 labeled images) and demonstrates improved label quality alongside increased unlabeled data usage. The approach combines a novel confidence refinement module, dynamic thresholding, and thorough ablations to show robustness to neighborhood design and parameter settings, offering a practical path to reduce annotation costs in dense prediction tasks.
Abstract
We present a novel confidence refinement scheme that enhances pseudo labels in semi-supervised semantic segmentation. Unlike existing methods, which filter pixels with low-confidence predictions in isolation, our approach leverages the spatial correlation of labels in segmentation maps by grouping neighboring pixels and considering their pseudo labels collectively. With this contextual information, our method, named S4MC, increases the amount of unlabeled data used during training while maintaining the quality of the pseudo labels, all with negligible computational overhead. Through extensive experiments on standard benchmarks, we demonstrate that S4MC outperforms existing state-of-the-art semi-supervised learning approaches, offering a promising solution for reducing the cost of acquiring dense annotations. For example, S4MC achieves a 1.39 mIoU improvement over the prior art on PASCAL VOC 12 with 366 annotated images. The code to reproduce our experiments is available at https://s4mcontext.github.io/
