Robust Principles: Architectural Design Principles for Adversarially Robust CNNs
ShengYun Peng, Weilin Xu, Cory Cornelius, Matthew Hull, Kevin Li, Rahul Duggal, Mansi Phute, Jason Martin, Duen Horng Chau
TL;DR
This work tackles the inconsistency in how CNN architectural choices affect adversarial robustness by proposing three generalizable principles: (A) optimal depth–width ranges, (B) convolutional stem over patch-based downsampling, and (C) robust residual blocks with squeeze-and-excitation and non-parametric smooth activations. Through extensive experiments on CIFAR-10, CIFAR-100, and ImageNet across multiple training recipes and model families, the authors show consistent robustness gains, culminating in ra architectures that outperform strong baselines including Transformers and NAS-based networks under adversarial evaluation. The findings demonstrate that principled architectural design can substantially enhance robustness, not just training tricks, and provide practical guidelines for building more resilient CNNs. The work is supported by public code and broad empirical validation across dataset scales and design spaces.
Abstract
Our research aims to unify existing works' diverging opinions on how architectural components affect the adversarial robustness of CNNs. To accomplish our goal, we synthesize a suite of three generalizable robust architectural design principles: (a) optimal range for depth and width configurations, (b) preferring convolutional over patchify stem stage, and (c) robust residual block design through adopting squeeze and excitation blocks and non-parametric smooth activation functions. Through extensive experiments across a wide spectrum of dataset scales, adversarial training methods, model parameters, and network design spaces, our principles consistently and markedly improve AutoAttack accuracy: 1-3 percentage points (pp) on CIFAR-10 and CIFAR-100, and 4-9 pp on ImageNet. The code is publicly available at https://github.com/poloclub/robust-principles.
