Defense without Forgetting: Continual Adversarial Defense with Anisotropic & Isotropic Pseudo Replay
Yuhang Zhou, Zhongyun Hua
TL;DR
The paper addresses continual adversarial defense where attacks arrive sequentially, risking catastrophic forgetting of previous defenses. It introduces Anisotropic & Isotropic Replay (AIR), a memory-free baseline that combines isotropic replay for neighborhood consistency, anisotropic mix-distill for richer semantics, and a regularizer to balance plasticity and stability, all within a self-distillation pseudo-replay framework. The approach yields an end-to-end loss that unifies adversarial training with pseudo-replay losses, achieving robustness across attack sequences and often approaching or exceeding Joint Training without data reuse. Empirical evaluation on MNIST, CIFAR-10, and CIFAR-100 demonstrates AIR's ability to mitigate forgetting, align feature distributions across attacks, and provide practical continual defense under varying attack budgets and sequences.
Abstract
Deep neural networks have demonstrated susceptibility to adversarial attacks. Adversarial defense techniques often focus on one-shot setting to maintain robustness against attack. However, new attacks can emerge in sequences in real-world deployment scenarios. As a result, it is crucial for a defense model to constantly adapt to new attacks, but the adaptation process can lead to catastrophic forgetting of previously defended against attacks. In this paper, we discuss for the first time the concept of continual adversarial defense under a sequence of attacks, and propose a lifelong defense baseline called Anisotropic \& Isotropic Replay (AIR), which offers three advantages: (1) Isotropic replay ensures model consistency in the neighborhood distribution of new data, indirectly aligning the output preference between old and new tasks. (2) Anisotropic replay enables the model to learn a compromise data manifold with fresh mixed semantics for further replay constraints and potential future attacks. (3) A straightforward regularizer mitigates the 'plasticity-stability' trade-off by aligning model output between new and old tasks. Experiment results demonstrate that AIR can approximate or even exceed the empirical performance upper bounds achieved by Joint Training.
