Enhanced MRI Representation via Cross-series Masking
Churan Wang, Fei Gao, Lijun Yan, Siwen Wang, Yizhou Yu, Yizhou Wang
TL;DR
This work tackles the challenge of learning robust representations from multi-series MRI data in the absence of extensive annotations. It introduces Cross-Series Masking (CSM), a self-supervised strategy that employs intra-series masking and inter-series masking to train a ViT-based encoder via reconstruction, enabling fusion of information across series. The learned representations achieve state-of-the-art results on brain tumor segmentation and improve breast MRI and prostate cancer diagnosis, even with limited labeled data, highlighting strong potential for clinical deployment. By leveraging large unlabeled multi-series datasets, CSM reduces annotation costs while delivering performance gains across diverse downstream tasks.
Abstract
Magnetic resonance imaging (MRI) is indispensable for diagnosing and planning treatment in various medical conditions due to its ability to produce multi-series images that reveal different tissue characteristics. However, integrating these diverse series to form a coherent analysis presents significant challenges, such as differing spatial resolutions and contrast patterns meanwhile requiring extensive annotated data, which is scarce in clinical practice. Due to these issues, we introduce a novel Cross-Series Masking (CSM) Strategy for effectively learning MRI representation in a self-supervised manner. Specifically, CSM commences by randomly sampling a subset of regions and series, which are then strategically masked. In the training process, the cross-series representation is learned by utilizing the unmasked data to reconstruct the masked portions. This process not only integrates information across different series but also facilitates the ability to model both intra-series and inter-series correlations and complementarities. With the learned representation, the downstream tasks like segmentation and classification are also enhanced. Taking brain tissue segmentation, breast tumor benign/malignant classification, and prostate cancer diagnosis as examples, our method achieves state-of-the-art performance on both public and in-house datasets.
