Deep learning for automated detection of breast cancer in deep ultraviolet fluorescence images with diffusion probabilistic model
Sepehr Salem Ghahfarokhi, Tyrell To, Julie Jorns, Tina Yen, Bing Yu, Dong Hye Ye
TL;DR
This work tackles data scarcity in intraoperative breast cancer margin assessment using deep ultraviolet fluorescence images by employing a diffusion probabilistic model (DPM) to synthesize labeled patches for augmentation. A patch-based pipeline extracts ResNet50 features, classifies with XGBoost, and fuses patch decisions with Grad-CAM++–derived regional maps to yield whole-surface predictions. DPM augmentation substantially improves performance, achieving 97% accuracy with 97% sensitivity and 93% specificity, outperforming affine augmentation and ProGAN baselines. The study demonstrates that diffusion-based data augmentation can enhance robustness and generalization in breast cancer detection from scarce medical imaging data, with practical implications for reducing margin misclassification during breast-conserving surgery.
Abstract
Data limitation is a significant challenge in applying deep learning to medical images. Recently, the diffusion probabilistic model (DPM) has shown the potential to generate high-quality images by converting Gaussian random noise into realistic images. In this paper, we apply the DPM to augment the deep ultraviolet fluorescence (DUV) image dataset with an aim to improve breast cancer classification for intraoperative margin assessment. For classification, we divide the whole surface DUV image into small patches and extract convolutional features for each patch by utilizing the pre-trained ResNet. Then, we feed them into an XGBoost classifier for patch-level decisions and then fuse them with a regional importance map computed by Grad-CAM++ for whole surface-level prediction. Our experimental results show that augmenting the training dataset with the DPM significantly improves breast cancer detection performance in DUV images, increasing accuracy from 93% to 97%, compared to using Affine transformations and ProGAN.
