Label Augmentation for Neural Networks Robustness
Fatemeh Amerehi, Patrick Healy
TL;DR
This work tackles the challenge of out-of-distribution robustness in neural networks by addressing both common corruptions and adversarial perturbations. The authors propose Label Augmentation (LA), which enriches the training signal by concatenating the original class label with an augmentation-specific label, yielding a combined output space and enforcing invariance to class identity while differentiating augmentation effects. Empirically, LA improves clean accuracy, calibration (ECE/RMS), and robustness to FGSM and PGD attacks across multiple architectures and datasets (CIFAR-10/100 and CIFAR-10-C/CIFAR-100-C), often outperforming conventional augmentation methods and even rivaling adversarial training under certain budgets. The findings suggest LA as a simple, flexible technique with potential applicability beyond images, offering a practical path to more reliable and trustworthy models under distributional shifts.
Abstract
Out-of-distribution generalization can be categorized into two types: common perturbations arising from natural variations in the real world and adversarial perturbations that are intentionally crafted to deceive neural networks. While deep neural networks excel in accuracy under the assumption of identical distributions between training and test data, they often encounter out-of-distribution scenarios resulting in a significant decline in accuracy. Data augmentation methods can effectively enhance robustness against common corruptions, but they typically fall short in improving robustness against adversarial perturbations. In this study, we develop Label Augmentation (LA), which enhances robustness against both common and intentional perturbations and improves uncertainty estimation. Our findings indicate a Clean error rate improvement of up to 23.29% when employing LA in comparisons to the baseline. Additionally, it enhances robustness under common corruptions benchmark by up to 24.23%. When tested against FGSM and PGD attacks, improvements in adversarial robustness are noticeable, with enhancements of up to 53.18% for FGSM and 24.46% for PGD attacks.
