Is Foreground Prototype Sufficient? Few-Shot Medical Image Segmentation with Background-Fused Prototype
Song Tang, Chunxiao Zu, Wenxin Su, Yuan Dong, Mao Ye, Yan Gan, Xiatian Zhu
TL;DR
The paper tackles the problem of few-shot medical image segmentation where foreground-only prototypes struggle due to concentrated frequency content and overlapping background features. It introduces Background-fused Prototype (Bro), a plug-in framework that combines Feature Similarity Calibration (FeaC) and Hierarchical Channel-Adversarial Attention (HiCA) to fuse background information into segmentation prototypes within a $N$-way $K$-shot meta-learning setting, optimized with $\\mathcal{L}_{seg}$, $\\mathcal{L}_{reg}$, and $\\beta \\\mathcal{L}_{adv}$. Empirical results on ABD-CT, ABD-MRI, and CMR demonstrate consistent Dice-score gains over state-of-the-art prototypical FSS methods, validating the importance of explicit background modeling in medical images. The work provides a practical, plug-in improvement for existing FSS models, enabling more accurate clinical segmentation with limited annotations and offering insights into background representation analysis through both qualitative and quantitative evaluations.
Abstract
Few-shot Semantic Segmentation(FSS)aim to adapt a pre-trained model to new classes with as few as a single labeled training sample per class. The existing prototypical work used in natural image scenarios biasedly focus on capturing foreground's discrimination while employing a simplistic representation for background, grounded on the inherent observation separation between foreground and background. However, this paradigm is not applicable to medical images where the foreground and background share numerous visual features, necessitating a more detailed description for background. In this paper, we present a new pluggable Background-fused prototype(Bro)approach for FSS in medical images. Instead of finding a commonality of background subjects in support image, Bro incorporates this background with two pivot designs. Specifically, Feature Similarity Calibration(FeaC)initially reduces noise in the support image by employing feature cross-attention with the query image. Subsequently, Hierarchical Channel Adversarial Attention(HiCA)merges the background into comprehensive prototypes. We achieve this by a channel groups-based attention mechanism, where an adversarial Mean-Offset structure encourages a coarse-to-fine fusion. Extensive experiments show that previous state-of-the-art methods, when paired with Bro, experience significant performance improvements. This demonstrates a more integrated way to represent backgrounds specifically for medical image.
