Hunting Attributes: Context Prototype-Aware Learning for Weakly Supervised Semantic Segmentation
Feilong Tang, Zhongxing Xu, Zhaojun Qu, Wei Feng, Xingjian Jiang, Zongyuan Ge
TL;DR
Weakly supervised semantic segmentation often yields incomplete localization due to a knowledge bias between instance features and contextual prototypes. CPAL introduces context prototype aware learning that builds a dense bank of context prototypes, anchors instance prototypes, and uses soft positive neighbor selection with feature distribution alignment to produce more complete CAMs, trained with a unified BCE plus self-supervised loss. The approach provides a prototype aware framework with context prototypes, shifting aligned features, and a PACAM self-supervised objective, validated on VOC 2012 and COCO 2014, achieving state-of-the-art or competitive results when plugged into multiple baselines. Overall, CPAL improves object localization and pseudo-label quality for WSSS, enabling stronger segmentation pipelines with broader applicability.
Abstract
Recent weakly supervised semantic segmentation (WSSS) methods strive to incorporate contextual knowledge to improve the completeness of class activation maps (CAM). In this work, we argue that the knowledge bias between instances and contexts affects the capability of the prototype to sufficiently understand instance semantics. Inspired by prototype learning theory, we propose leveraging prototype awareness to capture diverse and fine-grained feature attributes of instances. The hypothesis is that contextual prototypes might erroneously activate similar and frequently co-occurring object categories due to this knowledge bias. Therefore, we propose to enhance the prototype representation ability by mitigating the bias to better capture spatial coverage in semantic object regions. With this goal, we present a Context Prototype-Aware Learning (CPAL) strategy, which leverages semantic context to enrich instance comprehension. The core of this method is to accurately capture intra-class variations in object features through context-aware prototypes, facilitating the adaptation to the semantic attributes of various instances. We design feature distribution alignment to optimize prototype awareness, aligning instance feature distributions with dense features. In addition, a unified training framework is proposed to combine label-guided classification supervision and prototypes-guided self-supervision. Experimental results on PASCAL VOC 2012 and MS COCO 2014 show that CPAL significantly improves off-the-shelf methods and achieves state-of-the-art performance. The project is available at https://github.com/Barrett-python/CPAL.
