S&D Messenger: Exchanging Semantic and Domain Knowledge for Generic Semi-Supervised Medical Image Segmentation
Qixiang Zhang, Haonan Wang, Xiaomeng Li
TL;DR
This paper tackles the challenge of unifying three related medical image segmentation tasks—SSMIS, UMDA, and Semi-MDG—under a single semi-supervised framework. It introduces the Semantic & Domain Knowledge Messenger (S&D Messenger), which enables bidirectional knowledge transfer between the labeled (semantic knowledge) and unlabeled (domain knowledge) data flows via L2U patch-based knowledge delivery and U2L cross-attention-based Messenger Transformer blocks. By incorporating these knowledge deliveries into a naive pseudo-labeling baseline, the approach achieves state-of-the-art performance across six datasets and all three task categories, notably delivering substantial Dice gains at low labeling ratios. The method promises broad practical impact for robust, label-efficient training of Transformer-based models in multi-domain medical imaging contexts and beyond.
Abstract
Semi-supervised medical image segmentation (SSMIS) has emerged as a promising solution to tackle the challenges of time-consuming manual labeling in the medical field. However, in practical scenarios, there are often domain variations within the datasets, leading to derivative scenarios like semi-supervised medical domain generalization (Semi-MDG) and unsupervised medical domain adaptation (UMDA). In this paper, we aim to develop a generic framework that masters all three tasks. We notice a critical shared challenge across three scenarios: the explicit semantic knowledge for segmentation performance and rich domain knowledge for generalizability exclusively exist in the labeled set and unlabeled set respectively. Such discrepancy hinders existing methods from effectively comprehending both types of knowledge under semi-supervised settings. To tackle this challenge, we develop a Semantic & Domain Knowledge Messenger (S&D Messenger) which facilitates direct knowledge delivery between the labeled and unlabeled set, and thus allowing the model to comprehend both of them in each individual learning flow. Equipped with our S&D Messenger, a naive pseudo-labeling method can achieve huge improvement on six benchmark datasets for SSMIS (+7.5%), UMDA (+5.6%), and Semi-MDG tasks (+1.14%), compared with state-of-the-art methods designed for specific tasks.
