Lower Difficulty and Better Robustness: A Bregman Divergence Perspective for Adversarial Training
Zihui Wu, Haichang Gao, Bingqian Zhou, Xiaoyan Guo, Shudong Zhang
TL;DR
The paper tackles adversarial robustness in adversarial training (AT) by identifying optimization difficulty as a key bottleneck and introducing a Bregman-divergence perspective that links AT losses to $KL$-divergence and entropy-based geometry. It shows that TRADES, by separating accuracy and robustness objectives, is easier to optimize than PGD-AT, and proposes two methods, FAIT and MER, to further ease optimization while boosting robustness under $10$-step PGD and AutoAttack. FAIT introduces an interpolated PGD path to decouple robustness losses; MER maximizes output entropy to make robustness learning easier. Experiments across CIFAR-10/100 with multiple architectures and alternative distances demonstrate improved robustness, scalability, and generality, providing concrete design guidelines for robust AT methods.
Abstract
In this paper, we investigate on improving the adversarial robustness obtained in adversarial training (AT) via reducing the difficulty of optimization. To better study this problem, we build a novel Bregman divergence perspective for AT, in which AT can be viewed as the sliding process of the training data points on the negative entropy curve. Based on this perspective, we analyze the learning objectives of two typical AT methods, i.e., PGD-AT and TRADES, and we find that the optimization process of TRADES is easier than PGD-AT for that TRADES separates PGD-AT. In addition, we discuss the function of entropy in TRADES, and we find that models with high entropy can be better robustness learners. Inspired by the above findings, we propose two methods, i.e., FAIT and MER, which can both not only reduce the difficulty of optimization under the 10-step PGD adversaries, but also provide better robustness. Our work suggests that reducing the difficulty of optimization under the 10-step PGD adversaries is a promising approach for enhancing the adversarial robustness in AT.
