Model Ensemble for Brain Tumor Segmentation in Magnetic Resonance Imaging
Daniel Capellán-Martín, Zhifan Jiang, Abhijeet Parida, Xinyang Liu, Van Lam, Hareem Nisar, Austin Tapp, Sarah Elsharkawi, Maria J. Ledesma-Carbayo, Syed Muhammad Anwar, Marius George Linguraru
TL;DR
This work tackles brain tumor segmentation in BraTS 2023's newly added PED, MEN, and MET tasks with limited data. It introduces a region-wise ensemble of two state-of-the-art models, nnU-Net and Swin UNETR, combined with a cross-validated threshold-based post-processing pipeline to optimize lesion-wise performance. The approach yields improved lesion-wise Dice scores and ranks highly on unseen test data (PED 1st, MEN 3rd, MET 4th), underscoring the value of model ensembling and task-specific post-processing for rare tumors. The study also discusses limitations due to data scarcity and proposes future directions, including self-supervised pretraining and synthetic data augmentation, to boost generalizability across clinical settings.
Abstract
Segmenting brain tumors in multi-parametric magnetic resonance imaging enables performing quantitative analysis in support of clinical trials and personalized patient care. This analysis provides the potential to impact clinical decision-making processes, including diagnosis and prognosis. In 2023, the well-established Brain Tumor Segmentation (BraTS) challenge presented a substantial expansion with eight tasks and 4,500 brain tumor cases. In this paper, we present a deep learning-based ensemble strategy that is evaluated for newly included tumor cases in three tasks: pediatric brain tumors (PED), intracranial meningioma (MEN), and brain metastases (MET). In particular, we ensemble outputs from state-of-the-art nnU-Net and Swin UNETR models on a region-wise basis. Furthermore, we implemented a targeted post-processing strategy based on a cross-validated threshold search to improve the segmentation results for tumor sub-regions. The evaluation of our proposed method on unseen test cases for the three tasks resulted in lesion-wise Dice scores for PED: 0.653, 0.809, 0.826; MEN: 0.876, 0.867, 0.849; and MET: 0.555, 0.6, 0.58; for the enhancing tumor, tumor core, and whole tumor, respectively. Our method was ranked first for PED, third for MEN, and fourth for MET, respectively.
