Adversarially Robust Feature Learning for Breast Cancer Diagnosis
Degan Hao, Dooman Arefan, Margarita Zuley, Wendie Berg, Shandong Wu
TL;DR
The paper tackles adversarial vulnerability in breast cancer diagnosis by proposing Adversarially RobuSt Feature Learning (ARFL), a regularization that promotes strong feature-label correlations through a robust loss integrated into a minimax training framework over mixed standard and adversarial data. The robust loss, defined as $L_{robust}(\theta, x, y) = -(1/(HW)) * \sum_{i=1}^H \sum_{j=1}^W sigma(|f_{i,j}(\theta, x) * y|)$, guides the model to emphasize high-quality imaging features; the overall objective combines this with the standard cross-entropy loss via a weight $\lambda$, and training mixes data via a ratio $r$. Across synthetic and two real mammography datasets (Institution A and CMMD), dual adversarial training with ARFL achieves superior robustness (higher adversarial and mean AUC/accuracy) compared to DSBN, TRADES, and MIRST, while maintaining or improving performance on clean data. The results demonstrate ARFL's ability to learn discriminative, robust imaging features and provide practical guidance on mixing standard and adversarial data for safer clinical deployment.
Abstract
Adversarial data can lead to malfunction of deep learning applications. It is essential to develop deep learning models that are robust to adversarial data while accurate on standard, clean data. In this study, we proposed a novel adversarially robust feature learning (ARFL) method for a real-world application of breast cancer diagnosis. ARFL facilitates adversarial training using both standard data and adversarial data, where a feature correlation measure is incorporated as an objective function to encourage learning of robust features and restrain spurious features. To show the effects of ARFL in breast cancer diagnosis, we built and evaluated diagnosis models using two independent clinically collected breast imaging datasets, comprising a total of 9,548 mammogram images. We performed extensive experiments showing that our method outperformed several state-of-the-art methods and that our method can enhance safer breast cancer diagnosis against adversarial attacks in clinical settings.
