Transformer Classification of Breast Lesions: The BreastDCEDL_AMBL Benchmark Dataset and 0.92 AUC Baseline
Naomi Fridman, Anat Goldstein
TL;DR
This work tackles the specificity gap in breast DCE-MRI by developing a transformer-based lesion classifier using RGB-fused temporal MRI patches. Leveraging SegFormer and a large-scale, publicly available training corpus augmented with ISPY data, the authors achieve a lesion-level AUC of $0.92$ and 100% patient-level sensitivity, while providing interpretable malignant segmentation maps. A publicly released benchmark, BreastDCEDL_AMBL, together with trained models and evaluation protocols, offers a reproducible platform to benchmark future methods and accelerate clinical translation. The study also delineates clear directions for advancing toward end-to-end detection, volumetric processing, and privacy-preserving multi-institution learning.
Abstract
Breast magnetic resonance imaging is a critical tool for cancer detection and treatment planning, but its clinical utility is hindered by poor specificity, leading to high false-positive rates and unnecessary biopsies. This study introduces a transformer-based framework for automated classification of breast lesions in dynamic contrast-enhanced MRI, addressing the challenge of distinguishing benign from malignant findings. We implemented a SegFormer architecture that achieved an AUC of 0.92 for lesion-level classification, with 100% sensitivity and 67% specificity at the patient level - potentially eliminating one-third of unnecessary biopsies without missing malignancies. The model quantifies malignant pixel distribution via semantic segmentation, producing interpretable spatial predictions that support clinical decision-making. To establish reproducible benchmarks, we curated BreastDCEDL_AMBL by transforming The Cancer Imaging Archive's AMBL collection into a standardized deep learning dataset with 88 patients and 133 annotated lesions (89 benign, 44 malignant). This resource addresses a key infrastructure gap, as existing public datasets lack benign lesion annotations, limiting benign-malignant classification research. Training incorporated an expanded cohort of over 1,200 patients through integration with BreastDCEDL datasets, validating transfer learning approaches despite primary tumor-only annotations. Public release of the dataset, models, and evaluation protocols provides the first standardized benchmark for DCE-MRI lesion classification, enabling methodological advancement toward clinical deployment.
