A data-centric approach to class-specific bias in image data augmentation
Athanasios Angelakis, Andrey Rass
TL;DR
This paper investigates how data augmentation can induce class-specific bias that depends on dataset characteristics and model architecture. It evaluates Random Crop and Random Horizontal Flip across Fashion-MNIST, CIFAR-10, and CIFAR-100 using ResNet50, EfficientNetV2S, and SWIN Transformer, and introduces a Data Augmentation Robustness Scouting protocol that probes augmentation intensity $\alpha$ to quantify per-class and overall performance dynamics. The results reveal dataset- and architecture-dependent bias, with Vision Transformers showing delayed or altered bias dynamics relative to residual CNNs, and demonstrate a substantial reduction in computational cost (training 112 models vs 1860) while preserving bias-trend capture. These insights guide practical bias mitigation and model selection for deployments with aggressive data augmentation, and point to future work expanding architectures and datasets to further understand DA-induced biases.
Abstract
Data augmentation (DA) enhances model generalization in computer vision but may introduce biases, impacting class accuracy unevenly. Our study extends this inquiry, examining DA's class-specific bias across various datasets, including those distinct from ImageNet, through random cropping. We evaluated this phenomenon with ResNet50, EfficientNetV2S, and SWIN ViT, discovering that while residual models showed similar bias effects, Vision Transformers exhibited greater robustness or altered dynamics. This suggests a nuanced approach to model selection, emphasizing bias mitigation. We also refined a "data augmentation robustness scouting" method to manage DA-induced biases more efficiently, reducing computational demands significantly (training 112 models instead of 1860; a reduction of factor 16.2) while still capturing essential bias trends.
