Minimal Cascade Gradient Smoothing for Fast Transferable Preemptive Adversarial Defense
Hanrui Wang, Ching-Chun Chang, Chun-Shien Lu, Ching-Chia Kao, Isao Echizen
TL;DR
MSPD presents a fast, transferable preemptive defense that does not require access to target models or gradients. It introduces Minimal Cascade Gradient Smoothing (MCGS), a two-epoch cascade with forward and backward learning plus gradient smoothing trained on a surrogate backbone and decoupled classifier to achieve strong clean and robust performance across unseen models and attacks. An adaptive diagnostic, Preemptive Reversion, tests resilience to white-box reversals, showing MSPD remains robust except under unrealistically full gradient access, while standard purifiers fail to neutralize the defense. Extensive experiments on CIFAR-10 and ImageNet demonstrate state-of-the-art robustness, significantly higher speed, and favorable visual quality compared to prior preemptive defenses, with formal proofs supporting MSPD, MCGS, and Preemptive Reversion. The work highlights practical, user-side deployment of content protection that generalizes to unknown threats, offering a scalable solution for safeguarding media in real-world workflows.
Abstract
Adversarial attacks persist as a major challenge in deep learning. While training- and test-time defenses are well-studied, they often reduce clean accuracy, incur high cost, or fail under adaptive threats. In contrast, preemptive defenses, which perturb media before release, offer a practical alternative but remain slow, model-coupled, and brittle. We propose the Minimal Sufficient Preemptive Defense (MSPD), a fast, transferable framework that defends against future attacks without access to the target model or gradients. MSPD is driven by Minimal Cascade Gradient Smoothing (MCGS), a two-epoch optimization paradigm executed on a surrogate backbone. This defines a minimal yet effective regime for robust generalization across unseen models and attacks. MSPD runs at 0.02s/image (CIFAR-10) and 0.26s/image (ImageNet), 28--1696x faster than prior preemptive methods, while improving robust accuracy by +5% and clean accuracy by +3.7% across 11 models and 7 attacks. To evaluate adaptive robustness, we introduce Preemptive Reversion, the first white-box diagnostic attack that cancels preemptive perturbations under full gradient access. Even in this setting, MSPD retains a +2.2% robustness margin over the baseline. In practice, when gradients are unavailable, MSPD remains reliable and efficient. MSPD, MCGS, and Preemptive Reversion are each supported by formal theoretical proofs. The implementation is available at https://github.com/azrealwang/MSPD.
