Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation
Yixin Chen, Lin Gao, Yajuan Gao, Rui Wang, Jingge Lian, Xiangxi Meng, Yanhua Duan, Leiying Chai, Hongbin Han, Zhaoping Cheng, Zhaoheng Xie
TL;DR
The paper tackles the challenge of designing a universal segmentation model that operates across multiple medical-imaging modalities. It introduces the Modality Projection Universal Model (MPUM), which uses a modality projection controller to map a shared latent tissue representation into modality-specific feature spaces and dynamically generate convolutional kernels, enabling robust brain and whole-body segmentation without task-specific fine-tuning. MPUM demonstrates superior performance over state-of-the-art universal models on multi-modality segmentation, provides precise intracranial hemorrhage quantification for aided diagnosis, and enables whole-body metabolic analyses that reveal brain-body coupling in pediatric epilepsy, all while offering layer-wise saliency maps for improved interpretability. The framework integrates external pre-trained model features to stabilize latent-projection learning, enabling reliable cross-modality generalization and providing practical utility in emergency CT and epilepsy research settings. Overall, MPUM advances multi-task medical imaging by delivering accurate, interpretable segmentation across CT, MR, and PET with potential to streamline clinical workflows and support brain-body axis investigations.
Abstract
The integration of deep learning in medical imaging has shown great promise for enhancing diagnostic, therapeutic, and research outcomes. However, applying universal models across multiple modalities remains challenging due to the inherent variability in data characteristics. This study aims to introduce and evaluate a Modality Projection Universal Model (MPUM). MPUM employs a novel modality-projection strategy, which allows the model to dynamically adjust its parameters to optimize performance across different imaging modalities. The MPUM demonstrated superior accuracy in identifying anatomical structures, enabling precise quantification for improved clinical decision-making. It also identifies metabolic associations within the brain-body axis, advancing research on brain-body physiological correlations. Furthermore, MPUM's unique controller-based convolution layer enables visualization of saliency maps across all network layers, significantly enhancing the model's interpretability.
