Table of Contents
Fetching ...

Adversarially robust generalization theory via Jacobian regularization for deep neural networks

Dongya Wu, Xin Li

TL;DR

This work theoretically ties Jacobian regularization to adversarial robustness by showing that Jacobian-based surrogate losses $\hat{\ell}_{2}$ and $\hat{\ell}_{\infty}$ upper-bound the first-order robust loss under $\ell_{2}$ and $\ell_{\infty}$ perturbations. It derives robust generalization bounds via Rademacher complexity and covering-number techniques that apply to vector-valued models, and introduces two Jacobian regularizations, $\|\nabla_{x}f\|_{F}^{2}$ and $\|\nabla_{x}f\|_{1,1}$, to control robustness while remaining computationally viable. The paper proves that reducing Jacobian norms tightens the robust generalization gap and demonstrates through MNIST experiments that Jacobian-regularized risk minimization can serve as an effective surrogate for adversarially robust risk minimization, improving both standard and robust generalization. These results bridge theory and practice, offering a principled path to robust generalization via Jacobian regulation in deep networks.

Abstract

Powerful deep neural networks are vulnerable to adversarial attacks. To obtain adversarially robust models, researchers have separately developed adversarial training and Jacobian regularization techniques. There are abundant theoretical and empirical studies for adversarial training, but theoretical foundations for Jacobian regularization are still lacking. In this study, we show that Jacobian regularization is closely related to adversarial training in that $\ell_{2}$ or $\ell_{1}$ Jacobian regularized loss serves as an approximate upper bound on the adversarially robust loss under $\ell_{2}$ or $\ell_{\infty}$ adversarial attack respectively. Further, we establish the robust generalization gap for Jacobian regularized risk minimizer via bounding the Rademacher complexity of both the standard loss function class and Jacobian regularization function class. Our theoretical results indicate that the norms of Jacobian are related to both standard and robust generalization. We also perform experiments on MNIST data classification to demonstrate that Jacobian regularized risk minimization indeed serves as a surrogate for adversarially robust risk minimization, and that reducing the norms of Jacobian can improve both standard and robust generalization. This study promotes both theoretical and empirical understandings to adversarially robust generalization via Jacobian regularization.

Adversarially robust generalization theory via Jacobian regularization for deep neural networks

TL;DR

This work theoretically ties Jacobian regularization to adversarial robustness by showing that Jacobian-based surrogate losses and upper-bound the first-order robust loss under and perturbations. It derives robust generalization bounds via Rademacher complexity and covering-number techniques that apply to vector-valued models, and introduces two Jacobian regularizations, and , to control robustness while remaining computationally viable. The paper proves that reducing Jacobian norms tightens the robust generalization gap and demonstrates through MNIST experiments that Jacobian-regularized risk minimization can serve as an effective surrogate for adversarially robust risk minimization, improving both standard and robust generalization. These results bridge theory and practice, offering a principled path to robust generalization via Jacobian regulation in deep networks.

Abstract

Powerful deep neural networks are vulnerable to adversarial attacks. To obtain adversarially robust models, researchers have separately developed adversarial training and Jacobian regularization techniques. There are abundant theoretical and empirical studies for adversarial training, but theoretical foundations for Jacobian regularization are still lacking. In this study, we show that Jacobian regularization is closely related to adversarial training in that or Jacobian regularized loss serves as an approximate upper bound on the adversarially robust loss under or adversarial attack respectively. Further, we establish the robust generalization gap for Jacobian regularized risk minimizer via bounding the Rademacher complexity of both the standard loss function class and Jacobian regularization function class. Our theoretical results indicate that the norms of Jacobian are related to both standard and robust generalization. We also perform experiments on MNIST data classification to demonstrate that Jacobian regularized risk minimization indeed serves as a surrogate for adversarially robust risk minimization, and that reducing the norms of Jacobian can improve both standard and robust generalization. This study promotes both theoretical and empirical understandings to adversarially robust generalization via Jacobian regularization.

Paper Structure

This paper contains 9 sections, 13 theorems, 92 equations, 1 figure, 2 tables.

Key Result

Proposition 1

Bartlett2003RademacherAG Suppose the loss function $\ell(f(x),y)$ is bounded between $[0,B]$. Then for any $\delta \in (0,1)$, with probability at least $1-\delta$, the following inequality holds for all $f \in \mathcal{F}$,

Figures (1)

  • Figure 1: Comparison of Jacobian regularized loss and adversarially robust loss. We use the PGD attack loss to estimate the adversarially robust loss. In the figure legend, the losses represents the Jacobian regularized loss and the adv losses represents the adversarially robust loss.

Theorems & Definitions (25)

  • Definition 1: Rademacher complexity
  • Proposition 1
  • Proposition 2
  • Lemma 1
  • Lemma 2
  • Proposition 3
  • Lemma 3
  • Definition 2: Covering number
  • Lemma 4
  • Lemma 5
  • ...and 15 more