Boosting Model Resilience via Implicit Adversarial Data Augmentation
Xiaoling Zhou, Wei Ye, Zhemg Lee, Rui Xie, Shikun Zhang
TL;DR
The paper tackles learning under data biases such as long-tail, noisy labels, and subpopulation shifts by introducing Implicit Adversarial Data Augmentation (IADA), which augments samples in the deep feature space via per-sample adversarial and anti-adversarial perturbations. A surrogate loss, $ ext{L}^{IADA}$, is derived from the infinite-augmentation limit, and a meta-learning framework, Meta-IADA, trains a perturbation network to produce per-sample perturbations $epsilon_i$ guiding classifier optimization. Key contributions include the IADA loss, a per-sample perturbation mechanism, and extensive empirical validation across LT, GLT, noisy-label, and subpopulation-shift benchmarks showing state-of-the-art performance and improved robustness and fairness. The approach promises practical impact by enabling robust generalization without explicit augmentation, adapting to diverse biases through sample-wise perturbation strategies guided by a small meta-dataset.
Abstract
Data augmentation plays a pivotal role in enhancing and diversifying training data. Nonetheless, consistently improving model performance in varied learning scenarios, especially those with inherent data biases, remains challenging. To address this, we propose to augment the deep features of samples by incorporating their adversarial and anti-adversarial perturbation distributions, enabling adaptive adjustment in the learning difficulty tailored to each sample's specific characteristics. We then theoretically reveal that our augmentation process approximates the optimization of a surrogate loss function as the number of augmented copies increases indefinitely. This insight leads us to develop a meta-learning-based framework for optimizing classifiers with this novel loss, introducing the effects of augmentation while bypassing the explicit augmentation process. We conduct extensive experiments across four common biased learning scenarios: long-tail learning, generalized long-tail learning, noisy label learning, and subpopulation shift learning. The empirical results demonstrate that our method consistently achieves state-of-the-art performance, highlighting its broad adaptability.
