SFC: Shared Feature Calibration in Weakly Supervised Semantic Segmentation
Xinqiao Zhao, Feilong Tang, Xiaoyang Wang, Jimin Xiao
TL;DR
This work identifies long-tailed data as a driver of CAM miscalibration in weakly supervised semantic segmentation due to shared features across head and tail classes. It introduces Shared Feature Calibration (SFC), combining classifier weight CAM and prototype CAM with an Image Bank Re-sampling (IBR) strategy and a Multi-Scaled Distribution-Weighted (MSDW) consistency loss to balance activations and tighten CAM boundaries. The approach achieves new state-of-the-art results on PASCAL VOC 2012 and MS COCO 2014 for WSSS with only image-level labels, and ablations confirm the effectiveness of IBR, the MSDW components, and the distribution-coefficient weighting. The method offers a practical route to high-quality pseudo-labels and improved segmentation performance in long-tailed, weakly supervised settings, with code available for reproducibility.
Abstract
Image-level weakly supervised semantic segmentation has received increasing attention due to its low annotation cost. Existing methods mainly rely on Class Activation Mapping (CAM) to obtain pseudo-labels for training semantic segmentation models. In this work, we are the first to demonstrate that long-tailed distribution in training data can cause the CAM calculated through classifier weights over-activated for head classes and under-activated for tail classes due to the shared features among head- and tail- classes. This degrades pseudo-label quality and further influences final semantic segmentation performance. To address this issue, we propose a Shared Feature Calibration (SFC) method for CAM generation. Specifically, we leverage the class prototypes that carry positive shared features and propose a Multi-Scaled Distribution-Weighted (MSDW) consistency loss for narrowing the gap between the CAMs generated through classifier weights and class prototypes during training. The MSDW loss counterbalances over-activation and under-activation by calibrating the shared features in head-/tail-class classifier weights. Experimental results show that our SFC significantly improves CAM boundaries and achieves new state-of-the-art performances. The project is available at https://github.com/Barrett-python/SFC.
