BeSt-LeS: Benchmarking Stroke Lesion Segmentation using Deep Supervision
Prantik Deb, Lalith Bharadwaj Baru, Kamalaker Dadi, Bapi Raju S
TL;DR
This work benchmarks end-to-end U-Net–style models for stroke lesion segmentation on ATLAS v2.0 across 2D and 3D T1-weighted MRI. It evaluates five 2D architectures (U-Net, Residual U-Net, Attention U-Net, TransAttn U-Net, and U-Net Transformer) and three 3D variants, reporting the top Dice scores of $0.583$ for the 2D Transformer-based model and $0.504$ for the 3D Residual U-Net. A Wilcoxon Signed Rank Test on predicted versus true lesion volumes shows significant agreement for the 3D U-Net ($p\approx 0.0$, $\rho=0.949$) and 3D Residual U-Net ($p=0.001$, $\rho=0.962$), but not for Attention U-Net ($p=0.540$, $\rho=0.844$). The authors provide reproducible code at GitHub and discuss limitations such as the absence of data augmentation and alternative supervision strategies, recommending future work in augmentation, multi-modality data, and cascaded attention to advance stroke lesion segmentation.
Abstract
Brain stroke has become a significant burden on global health and thus we need remedies and prevention strategies to overcome this challenge. For this, the immediate identification of stroke and risk stratification is the primary task for clinicians. To aid expert clinicians, automated segmentation models are crucial. In this work, we consider the publicly available dataset ATLAS $v2.0$ to benchmark various end-to-end supervised U-Net style models. Specifically, we have benchmarked models on both 2D and 3D brain images and evaluated them using standard metrics. We have achieved the highest Dice score of 0.583 on the 2D transformer-based model and 0.504 on the 3D residual U-Net respectively. We have conducted the Wilcoxon test for 3D models to correlate the relationship between predicted and actual stroke volume. For reproducibility, the code and model weights are made publicly available: https://github.com/prantik-pdeb/BeSt-LeS.
