CCSD: Cross-Modal Compositional Self-Distillation for Robust Brain Tumor Segmentation with Missing Modalities
Dongqing Xie, Yonghuang Wu, Zisheng Ai, Jun Min, Zhencun Jiang, Shaojin Geng, Lei Wang
TL;DR
CCSD addresses brain tumor segmentation from multi-modal MRI under missing modalities by introducing a shared-specific encoder-decoder with two self-distillation strategies. HMSD transfers knowledge across modality hierarchies from full to partial modality sets, while DMCD simulates progressive modality dropout along a criticality-informed path to boost robustness. Evaluations on BraTS benchmarks show state-of-the-art performance and strong stability across diverse missing-modality scenarios, with ablations confirming the contribution of each distillation component. This approach offers a practical, scalable solution for clinical settings where complete multi-modal data are often unavailable, and it can extend to other multi-modal medical imaging tasks with minimal architectural changes.
Abstract
The accurate segmentation of brain tumors from multi-modal MRI is critical for clinical diagnosis and treatment planning. While integrating complementary information from various MRI sequences is a common practice, the frequent absence of one or more modalities in real-world clinical settings poses a significant challenge, severely compromising the performance and generalizability of deep learning-based segmentation models. To address this challenge, we propose a novel Cross-Modal Compositional Self-Distillation (CCSD) framework that can flexibly handle arbitrary combinations of input modalities. CCSD adopts a shared-specific encoder-decoder architecture and incorporates two self-distillation strategies: (i) a hierarchical modality self-distillation mechanism that transfers knowledge across modality hierarchies to reduce semantic discrepancies, and (ii) a progressive modality combination distillation approach that enhances robustness to missing modalities by simulating gradual modality dropout during training. Extensive experiments on public brain tumor segmentation benchmarks demonstrate that CCSD achieves state-of-the-art performance across various missing-modality scenarios, with strong generalization and stability.
