MedMAP: Promoting Incomplete Multi-modal Brain Tumor Segmentation with Alignment
Tianyi Liu, Zhaorui Tan, Muyin Chen, Xi Yang, Haochuan Jiang, Kaizhu Huang
TL;DR
MedMAP introduces a latent-space alignment paradigm for missing-modality brain tumor segmentation by anchoring modality-specific features to a predefined distribution $P_{mix}$, with two anchor strategies $P^{k}_{mix}$ and $P^{*}_{mix}$. The approach is theoretically supported by a tighter ELBO when aligning each modality individually to $P_{mix}$, and empirically validated across BraTS2018/2020 backbones (KD, SLS, DA) to reduce modality gaps and improve Dice scores under missing modalities. Key findings show consistent improvements across targets (WT, TC, ET) and datasets, with adaptive anchoring ($P^{*}_{mix}$) outperforming fixed anchors and standard normal baselines. Overall, MedMAP provides a general, backbone-agnostic method to learn invariant cross-modality representations, enabling more reliable brain tumor segmentation when MRI modalities are incomplete.
Abstract
Brain tumor segmentation is often based on multiple magnetic resonance imaging (MRI). However, in clinical practice, certain modalities of MRI may be missing, which presents a more difficult scenario. To cope with this challenge, Knowledge Distillation, Domain Adaption, and Shared Latent Space have emerged as commonly promising strategies. However, recent efforts typically overlook the modality gaps and thus fail to learn important invariant feature representations across different modalities. Such drawback consequently leads to limited performance for missing modality models. To ameliorate these problems, pre-trained models are used in natural visual segmentation tasks to minimize the gaps. However, promising pre-trained models are often unavailable in medical image segmentation tasks. Along this line, in this paper, we propose a novel paradigm that aligns latent features of involved modalities to a well-defined distribution anchor as the substitution of the pre-trained model}. As a major contribution, we prove that our novel training paradigm ensures a tight evidence lower bound, thus theoretically certifying its effectiveness. Extensive experiments on different backbones validate that the proposed paradigm can enable invariant feature representations and produce models with narrowed modality gaps. Models with our alignment paradigm show their superior performance on both BraTS2018 and BraTS2020 datasets.
