Attention-gated U-Net model for semantic segmentation of brain tumors and feature extraction for survival prognosis
Rut Pate, Snehal Rajput, Mehul S. Raval, Rupal A. Kapdi, Mohendra Roy
TL;DR
This study introduces a tri-planar Attention-Gated Recurrent Residual U-Net (R2U-Net) designed for efficient brain tumor segmentation in MRI and for survival-day (SD) prognosis. Three parallel 2D networks process sagittal, coronal, and axial planes, with their encoder features fused to predict SD through a 3-layer ANN, while segmentation outputs are combined by averaging plane-wise probability maps. The model achieves competitive BraTS2021 validation Dice scores (e.g., WT ≈ 0.900, TC ≈ 0.824, ET ≈ 0.775) and strong specificity, while SD prediction on BraTS2020 yields 45.71% accuracy, MSE ≈ 1.08×10^5, and SpearmanR ≈ 0.338, highlighting both potential and areas for improvement. The approach emphasizes efficiency and interpretability by leveraging multi-plane features and encoder bottlenecks to support clinical decision-making, with future directions including lighter architectures and integration of radiomic and multi-omics data to enhance prognosis accuracy.
Abstract
Gliomas, among the most common primary brain tumors, vary widely in aggressiveness, prognosis, and histology, making treatment challenging due to complex and time-intensive surgical interventions. This study presents an Attention-Gated Recurrent Residual U-Net (R2U-Net) based Triplanar (2.5D) model for improved brain tumor segmentation. The proposed model enhances feature representation and segmentation accuracy by integrating residual, recurrent, and triplanar architectures while maintaining computational efficiency, potentially aiding in better treatment planning. The proposed method achieves a Dice Similarity Score (DSC) of 0.900 for Whole Tumor (WT) segmentation on the BraTS2021 validation set, demonstrating performance comparable to leading models. Additionally, the triplanar network extracts 64 features per planar model for survival days prediction, which are reduced to 28 using an Artificial Neural Network (ANN). This approach achieves an accuracy of 45.71%, a Mean Squared Error (MSE) of 108,318.128, and a Spearman Rank Correlation Coefficient (SRC) of 0.338 on the test dataset.
