Analyzing Data Augmentation for Medical Images: A Case Study in Ultrasound Images
Adam Tupper, Christian Gagné
TL;DR
Medical-imaging data scarcity motivates systematic evaluation of augmentation strategies. The authors compare individual, paired, and random-sampling augmentations on breast lesion classification in ultrasound, using $5 \times 2$ cross-validation across BUSI and BUS-BRA datasets with a ResNet-18 baseline. They find substantial variability in the effectiveness of individual transforms, but randomly sampling from a diverse augmentation pool with TrivialAugment yields consistent, sometimes dramatic, gains up to about $10.4\%$ in balanced accuracy. The work provides practical guidance on augmentation policy, showing that random augmentation pools outperform fixed sequences and offering a blueprint for rigorous augmentation studies across modalities. Overall, it advances standardization of data augmentation in breast ultrasonography and informs augmentation strategies for broader medical-imaging tasks.
Abstract
Data augmentation is one of the most effective techniques to improve the generalization performance of deep neural networks. Yet, despite often facing limited data availability in medical image analysis, it is frequently underutilized. This appears to be due to a gap in our collective understanding of the efficacy of different augmentation techniques across medical imaging tasks and modalities. One domain where this is especially true is breast ultrasound images. This work addresses this issue by analyzing the effectiveness of different augmentation techniques for the classification of breast lesions in ultrasound images. We assess the generalizability of our findings across several datasets, demonstrate that certain augmentations are far more effective than others, and show that their usage leads to significant performance gains.
