Position: Towards Resilience Against Adversarial Examples

Sihui Dai; Chong Xiang; Tong Wu; Prateek Mittal

Position: Towards Resilience Against Adversarial Examples

Sihui Dai, Chong Xiang, Tong Wu, Prateek Mittal

TL;DR

The paper argues that robustness against a single or known set of attacks is insufficient for real-world deployments, as attackers can uncover novel perturbations post-deployment. It introduces adversarial resilience and a simplified continual adaptive robustness (CAR) framework wherein defenses can update quickly in response to newly discovered attacks, guided by time-evolving attacker and defender knowledge sets. CAR bridges to existing concepts like simultaneous multiattack robustness (sMAR) and unforeseen attack robustness (UAR), indicating that advances in those areas can directly enhance resilience. The work also outlines practical applications, challenges such as label scarcity and poisoning risks, and open directions for research, including standardized evaluation and efficient fine-tuning strategies for rapid adaptation. Overall, the authors advocate for a shift from static robustness to adaptive resilience to ensure sustained protection against evolving adversarial threats.

Abstract

Current research on defending against adversarial examples focuses primarily on achieving robustness against a single attack type such as $\ell_2$ or $\ell_{\infty}$-bounded attacks. However, the space of possible perturbations is much larger than considered by many existing defenses and is difficult to mathematically model, so the attacker can easily bypass the defense by using a type of attack that is not covered by the defense. In this position paper, we argue that in addition to robustness, we should also aim to develop defense algorithms that are adversarially resilient -- defense algorithms should specify a means to quickly adapt the defended model to be robust against new attacks. We provide a definition of adversarial resilience and outline considerations of designing an adversarially resilient defense. We then introduce a subproblem of adversarial resilience which we call continual adaptive robustness, in which the defender gains knowledge of the formulation of possible perturbation spaces over time and can then update their model based on this information. Additionally, we demonstrate the connection between continual adaptive robustness and previously studied problems of multiattack robustness and unforeseen attack robustness and outline open directions within these fields which can contribute to improving continual adaptive robustness and adversarial resilience.

Position: Towards Resilience Against Adversarial Examples

TL;DR

Abstract

Current research on defending against adversarial examples focuses primarily on achieving robustness against a single attack type such as

-bounded attacks. However, the space of possible perturbations is much larger than considered by many existing defenses and is difficult to mathematically model, so the attacker can easily bypass the defense by using a type of attack that is not covered by the defense. In this position paper, we argue that in addition to robustness, we should also aim to develop defense algorithms that are adversarially resilient -- defense algorithms should specify a means to quickly adapt the defended model to be robust against new attacks. We provide a definition of adversarial resilience and outline considerations of designing an adversarially resilient defense. We then introduce a subproblem of adversarial resilience which we call continual adaptive robustness, in which the defender gains knowledge of the formulation of possible perturbation spaces over time and can then update their model based on this information. Additionally, we demonstrate the connection between continual adaptive robustness and previously studied problems of multiattack robustness and unforeseen attack robustness and outline open directions within these fields which can contribute to improving continual adaptive robustness and adversarial resilience.

Paper Structure (17 sections, 4 equations, 1 figure, 1 table)

This paper contains 17 sections, 4 equations, 1 figure, 1 table.

Introduction
Background: Adversarial Robustness
A Game Formulation of Single Attack Robustness
Attacks in Adversarial ML Research
Defenses in Adversarial ML Research
Towards Resilience Against Adversarial Examples
Applications for which Resilience is Important
Challenges of an Adversarially Resilient Defense and Open Directions
Continual Adaptive Robustness: A Simplified Setting of Adversarial Resilience
Open Directions for Research
Connection to Simultaneous Multiattack and Unforeseen Attack Robustness
Conclusion
Instantiating the Single Attack Game
Prior Work in Simultaneous Multiattack Robustness
Open Directions for Research in Simultaneous Multiattack Robustness
...and 2 more sections

Figures (1)

Figure 2: Full knowledge vs knowledge mismatch. The white box represents the space of possible perturbations that we would expect a model to be robust to (ie. space of imperceptible perturbations) which we may not know how to model. The green oval represents the space of perturbations captured in the defender knowledge set and the red oval represents the space of perturbations captured in the attacker knowledge set. (A) Full knowledge occurs when the defender knows the space known to the attacker while (B) Knowledge mismatch occurs when there exist perturbations known to the attacker that are not known to the defender. Robustness in this setting corresponds to unforeseen attack robustness.

Theorems & Definitions (6)

Definition 2.1: Single Attack Game
Definition 3.1: Adaptive Attacker Knowledge Set
Definition 3.2: Adaptive Defense Algorithm
Definition 3.3: Adversarial Resilience
Definition 6.1: Adaptive Defender Knowledge Set
Definition 6.2: Continual Adaptive Robustness

Position: Towards Resilience Against Adversarial Examples

TL;DR

Abstract

Position: Towards Resilience Against Adversarial Examples

Authors

TL;DR

Abstract

Table of Contents

Figures (1)

Theorems & Definitions (6)