Revisiting the Trade-off between Accuracy and Robustness via Weight Distribution of Filters
Xingxing Wei, Shiji Zhao, Bo li
TL;DR
This work addresses the persistent trade-off between clean accuracy and adversarial robustness in deep networks. It shows that static networks inherently struggle to reconcile two distinct weight regimes induced by standard and adversarial training, and it theoretically links this to gradient-regularization effects. To overcome this, the authors propose AW-Net, a sample-wise dynamic network with a filter-level weight-variation mechanism and a multi-head adversarial router, stabilized by MixBN and trained end-to-end via MTARD. Empirically, AW-Net achieves superior Weighted Robust Accuracy across CIFAR-10/100 and Tiny-ImageNet, beating static and several dynamic baselines and exhibiting resilience to adaptive attacks. The approach offers a practical pathway to jointly improve accuracy and robustness by adapting weights to input characteristics rather than forcing a single, compromised weight configuration.
Abstract
Adversarial attacks have been proven to be potential threats to Deep Neural Networks (DNNs), and many methods are proposed to defend against adversarial attacks. However, while enhancing the robustness, the clean accuracy will decline to a certain extent, implying a trade-off existed between the accuracy and robustness. In this paper, to meet the trade-off problem, we theoretically explore the underlying reason for the difference of the filters' weight distribution between standard-trained and robust-trained models and then argue that this is an intrinsic property for static neural networks, thus they are difficult to fundamentally improve the accuracy and adversarial robustness at the same time. Based on this analysis, we propose a sample-wise dynamic network architecture named Adversarial Weight-Varied Network (AW-Net), which focuses on dealing with clean and adversarial examples with a "divide and rule" weight strategy. The AW-Net adaptively adjusts the network's weights based on regulation signals generated by an adversarial router, which is directly influenced by the input sample. Benefiting from the dynamic network architecture, clean and adversarial examples can be processed with different network weights, which provides the potential to enhance both accuracy and adversarial robustness. A series of experiments demonstrate that our AW-Net is architecture-friendly to handle both clean and adversarial examples and can achieve better trade-off performance than state-of-the-art robust models.
