GBT-SAM: A Parameter-Efficient Depth-Aware Model for Generalizable Brain tumour Segmentation on mp-MRI
Cecilia Diana-Albelda, Roberto Alcover-Couso, Álvaro García-Martín, Jesus Bescos, Marcos Escudero-Viñolo
TL;DR
This work tackles automatic brain tumor segmentation on multi-parametric MRI by adapting the Segment Anything Model (SAM) to volumetric data in a parameter-efficient way. The proposed GBT-SAM uses a four-channel patch embedding to fuse T1, T2, T1c, and T2-FLAIR, a two-stage training regime, LoRA-based PEFT, and a Depth-Condition module to capture inter-slice correlations, achieving 6.5M trainable parameters. It delivers a Dice score of 93.54 on BraTS Adult Glioma and demonstrates strong cross-domain generalization to Meningioma, Pediatric Glioma, and Sub-Saharan Glioma, highlighting practical efficiency and robustness. The approach offers a promising, scalable solution for clinical workflows with reduced computational cost while maintaining high segmentation accuracy.
Abstract
Gliomas are aggressive brain tumors that require accurate imaging-based diagnosis, with segmentation playing a critical role in evaluating morphology and treatment decisions. Manual delineation of gliomas is time-consuming and prone to variability, motivating the use of deep learning to improve consistency and alleviate clinical workload. However, existing methods often fail to fully exploit the information available in multi-parametric MRI (mp-MRI), particularly inter-slice contextual features, and typically require considerable computational resources while lacking robustness across tumor type variations. We present GBT-SAM, a parameter-efficient deep learning framework that adapts the Segment Anything Model (SAM), a large-scale vision model, to volumetric mp-MRI data. GBT-SAM reduces input complexity by selecting fewer than 2.6\% of slices per scan while incorporating all four MRI modalities, preserving essential tumor-related information with minimal cost. Furthermore, our model is trained by a two-step fine-tuning strategy that incorporates a depth-aware module to capture inter-slice correlations and lightweight adaptation layers, resulting in just 6.5M trainable parameters, which is the lowest among SAM-based approaches. GBT-SAM achieves a Dice Score of 93.54 on the BraTS Adult Glioma dataset and demonstrates robust performance on Meningioma, Pediatric Glioma, and Sub-Saharan Glioma datasets. These results highlight GBT-SAM's potential as a computationally efficient and domain-robust framework for brain tumor segmentation using mp-MRI. Our code and models are available at https://github.com/vpulab/med-sam-brain .
