Uniformly Stable Algorithms for Adversarial Training and Beyond

Jiancong Xiao; Jiawei Zhang; Zhi-Quan Luo; Asuman Ozdaglar

Uniformly Stable Algorithms for Adversarial Training and Beyond

Jiancong Xiao, Jiawei Zhang, Zhi-Quan Luo, Asuman Ozdaglar

TL;DR

This paper tackles robust overfitting in adversarial training by leveraging uniform stability as a generalization lens. It introduces Moreau envelope-$\mathcal{A}$ (ME-$\mathcal{A}$), a min-min reformulation that decouples non-smoothness from non-strong convexity, combining an inner strongly convex, non-smooth problem with an outer convex, smooth one. The authors prove uniform stability for both the inner and outer steps, and hence for the overall training, without extra computational cost, and show ME-$\mathcal{A}$ mitigates robust overfitting in practice. Beyond adversarial training, ME-$\mathcal{A}$ is presented as the first uniformly stable algorithm for weakly-convex, non-smooth problems, providing a principled path to improved generalization in challenging non-smooth settings.

Abstract

In adversarial machine learning, neural networks suffer from a significant issue known as robust overfitting, where the robust test accuracy decreases over epochs (Rice et al., 2020). Recent research conducted by Xing et al.,2021; Xiao et al., 2022 has focused on studying the uniform stability of adversarial training. Their investigations revealed that SGD-based adversarial training fails to exhibit uniform stability, and the derived stability bounds align with the observed phenomenon of robust overfitting in experiments. This motivates us to develop uniformly stable algorithms specifically tailored for adversarial training. To this aim, we introduce Moreau envelope-$\mathcal{A}$, a variant of the Moreau Envelope-type algorithm. We employ a Moreau envelope function to reframe the original problem as a min-min problem, separating the non-strong convexity and non-smoothness of the adversarial loss. Then, this approach alternates between solving the inner and outer minimization problems to achieve uniform stability without incurring additional computational overhead. In practical scenarios, we show the efficacy of ME-$\mathcal{A}$ in mitigating the issue of robust overfitting. Beyond its application in adversarial training, this represents a fundamental result in uniform stability analysis, as ME-$\mathcal{A}$ is the first algorithm to exhibit uniform stability for weakly-convex, non-smooth problems.

Uniformly Stable Algorithms for Adversarial Training and Beyond

TL;DR

This paper tackles robust overfitting in adversarial training by leveraging uniform stability as a generalization lens. It introduces Moreau envelope-

(ME-

), a min-min reformulation that decouples non-smoothness from non-strong convexity, combining an inner strongly convex, non-smooth problem with an outer convex, smooth one. The authors prove uniform stability for both the inner and outer steps, and hence for the overall training, without extra computational cost, and show ME-

mitigates robust overfitting in practice. Beyond adversarial training, ME-

is presented as the first uniformly stable algorithm for weakly-convex, non-smooth problems, providing a principled path to improved generalization in challenging non-smooth settings.

Abstract

, a variant of the Moreau Envelope-type algorithm. We employ a Moreau envelope function to reframe the original problem as a min-min problem, separating the non-strong convexity and non-smoothness of the adversarial loss. Then, this approach alternates between solving the inner and outer minimization problems to achieve uniform stability without incurring additional computational overhead. In practical scenarios, we show the efficacy of ME-

in mitigating the issue of robust overfitting. Beyond its application in adversarial training, this represents a fundamental result in uniform stability analysis, as ME-

is the first algorithm to exhibit uniform stability for weakly-convex, non-smooth problems.

Paper Structure (56 sections, 12 theorems, 67 equations, 3 figures, 7 tables, 1 algorithm)

This paper contains 56 sections, 12 theorems, 67 equations, 3 figures, 7 tables, 1 algorithm.

Introduction
Related Work
Uniform Stability.
Smooth Cases.
Proximal Methods.
Non-Smooth Cases.
Uniform Convergence Analysis.
Preliminaries of Stability Analysis
Hypothesis Class.
Adversarial Loss.
TRADES Loss.
Moreau Envelope-$\mathcal{A}$
Equivalent Problem
Uniform Stability of Inner Minimization.
Uniform Stability of Outer Minimization.
...and 41 more sections

Key Result

Theorem 3.2

Let $A$ be $\varepsilon$-uniformly stable. Then, the expected generalization gap satisfies

Figures (3)

Figure 1: Demonstration of robust overfitting (Orange Line) and mitigating robust overfitting by Moreau envelope-$\mathcal{A}$ (Red Line).
Figure 2: (a): Comparison of generalization gap induced by SGD and ME-SGD for the toy example. (b) and (c): Robust test accuracy of adversarial training using SGD and ME-$\mathcal{A}$ on SVHN, and CFAR-100, respectively.
Figure 3: Robust generalization gap and robust test accuracy in the experiments of SGD and ME-$\mathcal{A}$.

Theorems & Definitions (17)

Definition 3.1
Theorem 3.2: Generalization in expectation hardt2016train
Lemma 4.1: Equivalent Problem
Lemma 4.2: Uniform Stability of Inner Minimization
Lemma 4.3: Smoothness and Convexity of Moreau Envelops Functions
Lemma 4.4: Uniform Stability of Outer Minimization
Theorem 4.5: Generalization bound of ME-$\mathcal{A}$ in Convex Case
Definition 4.6
Theorem 4.7: Generalization bound of ME-$\mathcal{A}$ in Weakly-Convex Case
Theorem 4.8: Optimization error of ME-$\mathcal{A}$ in Convex Case
...and 7 more

Uniformly Stable Algorithms for Adversarial Training and Beyond

TL;DR

Abstract

Uniformly Stable Algorithms for Adversarial Training and Beyond

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (17)