Over-parameterization and Adversarial Robustness in Neural Networks: An Overview and Empirical Analysis
Srishti Gupta, Zhang Chen, Luca Demetrio, Xiaoyi Feng, Zhaoqiang Xia, Antonio Emanuele Cinà, Maura Pintor, Luca Oneto, Ambra Demontis, Battista Biggio, Fabio Roli
TL;DR
The paper investigates how over-parameterization affects adversarial robustness, addressing conflicting claims in prior work by incorporating reliability checks for attack effectiveness. By combining PGD with AutoAttack and IoAF indicators, the authors demonstrate that larger networks not only generalize better but also exhibit greater robustness to adversarial perturbations than smaller ones. The experiments on MNIST and CIFAR-10 across CNN, FC-ReLU, and ResNet architectures show that base performance remains strong or improves with size, and robustness gains are consistent across evaluation methods. The study argues that previous inconsistencies stem from unreliable attack evaluations and advocates reliability as a standard component of robustness analyses, with implications for theory and future secure ML research.
Abstract
Thanks to their extensive capacity, over-parameterized neural networks exhibit superior predictive capabilities and generalization. However, having a large parameter space is considered one of the main suspects of the neural networks' vulnerability to adversarial example -- input samples crafted ad-hoc to induce a desired misclassification. Relevant literature has claimed contradictory remarks in support of and against the robustness of over-parameterized networks. These contradictory findings might be due to the failure of the attack employed to evaluate the networks' robustness. Previous research has demonstrated that depending on the considered model, the algorithm employed to generate adversarial examples may not function properly, leading to overestimating the model's robustness. In this work, we empirically study the robustness of over-parameterized networks against adversarial examples. However, unlike the previous works, we also evaluate the considered attack's reliability to support the results' veracity. Our results show that over-parameterized networks are robust against adversarial attacks as opposed to their under-parameterized counterparts.
