MMARD: Improving the Min-Max Optimization Process in Adversarial Robustness Distillation
Yuzheng Wang, Zhaoyu Chen, Dingkang Yang, Yuanhang Wang, Lizhe Qi
TL;DR
The paper tackles robustness saturation and teacher-dependence in Adversarial Robustness Distillation by reframing the min-max ARD optimization. It introduces MMARD, which enriches the inner maximization with the teacher’s robust predictions to generate more informative adversarial examples and adds a triangular relationship in the outer minimization to better capture invariant mappings between natural and robust scenarios. Empirical results on CIFAR-10/100 with WideResNet teachers show state-of-the-art robust accuracy under AutoAttack and improved resilience to teacher choice, while maintaining competitive clean accuracy. The approach is plug-and-play and compatible with existing ARD methods, offering a practical route to stronger small-model robustness with less hyperparameter sensitivity.
Abstract
Adversarial Robustness Distillation (ARD) is a promising task to boost the robustness of small-capacity models with the guidance of the pre-trained robust teacher. The ARD can be summarized as a min-max optimization process, i.e., synthesizing adversarial examples (inner) & training the student (outer). Although competitive robustness performance, existing ARD methods still have issues. In the inner process, the synthetic training examples are far from the teacher's decision boundary leading to important robust information missing. In the outer process, the student model is decoupled from learning natural and robust scenarios, leading to the robustness saturation, i.e., student performance is highly susceptible to customized teacher selection. To tackle these issues, this paper proposes a general Min-Max optimization Adversarial Robustness Distillation (MMARD) method. For the inner process, we introduce the teacher's robust predictions, which drive the training examples closer to the teacher's decision boundary to explore more robust knowledge. For the outer process, we propose a structured information modeling method based on triangular relationships to measure the mutual information of the model in natural and robust scenarios and enhance the model's ability to understand multi-scenario mapping relationships. Experiments show our MMARD achieves state-of-the-art performance on multiple benchmarks. Besides, MMARD is plug-and-play and convenient to combine with existing methods.
