Towards Universal Certified Robustness with Multi-Norm Training

Enyi Jiang; David S. Cheung; Gagandeep Singh

Towards Universal Certified Robustness with Multi-Norm Training

Enyi Jiang, David S. Cheung, Gagandeep Singh

TL;DR

This work addresses the gap in certified robustness across multiple perturbation norms by introducing CURE, a deterministic multi-norm certified training framework designed to achieve union and universal robustness. It formalizes the l_q–l_r trade-off, develops bound-alignment and natural-training integration techniques, and proposes several training schemes (CURE-Joint, CURE-Max, CURE-Random) plus bound-alignment and quick certified fine-tuning to leverage pre-trained single-norm models. Empirical results on MNIST, CIFAR-10, and TinyImagenet show substantial gains in union robustness (e.g., up to 32.0% on MNIST, 25.8% on CIFAR-10, 10.6% on TinyImagenet) and improved robustness to unseen geometric/patch perturbations. These findings advance toward universal certified robustness by enabling efficient multi-norm training and effective fine-tuning of pre-trained models.

Abstract

Existing certified training methods can only train models to be robust against a certain perturbation type (e.g. $l_\infty$ or $l_2$). However, an $l_\infty$ certifiably robust model may not be certifiably robust against $l_2$ perturbation (and vice versa) and also has low robustness against other perturbations (e.g. geometric and patch transformation). By constructing a theoretical framework to analyze and mitigate the tradeoff, we propose the first multi-norm certified training framework \textbf{CURE}, consisting of several multi-norm certified training methods, to attain better \emph{union robustness} when training from scratch or fine-tuning a pre-trained certified model. Inspired by our theoretical findings, we devise bound alignment and connect natural training with certified training for better union robustness. Compared with SOTA-certified training, \textbf{CURE} improves union robustness to $32.0\%$ on MNIST, $25.8\%$ on CIFAR-10, and $10.6\%$ on TinyImagenet across different epsilon values. It leads to better generalization on a diverse set of challenging unseen geometric and patch perturbations to $6.8\%$ and $16.0\%$ on CIFAR-10. Overall, our contributions pave a path towards \textit{universal certified robustness}.

Towards Universal Certified Robustness with Multi-Norm Training

TL;DR

Abstract

Existing certified training methods can only train models to be robust against a certain perturbation type (e.g.

). However, an

certifiably robust model may not be certifiably robust against

perturbation (and vice versa) and also has low robustness against other perturbations (e.g. geometric and patch transformation). By constructing a theoretical framework to analyze and mitigate the tradeoff, we propose the first multi-norm certified training framework \textbf{CURE}, consisting of several multi-norm certified training methods, to attain better \emph{union robustness} when training from scratch or fine-tuning a pre-trained certified model. Inspired by our theoretical findings, we devise bound alignment and connect natural training with certified training for better union robustness. Compared with SOTA-certified training, \textbf{CURE} improves union robustness to

on MNIST,

on CIFAR-10, and

on TinyImagenet across different epsilon values. It leads to better generalization on a diverse set of challenging unseen geometric and patch perturbations to

and

on CIFAR-10. Overall, our contributions pave a path towards \textit{universal certified robustness}.

Paper Structure (29 sections, 2 theorems, 19 equations, 8 figures, 11 tables, 6 algorithms)

This paper contains 29 sections, 2 theorems, 19 equations, 8 figures, 11 tables, 6 algorithms.

Introduction
Related Work
Background
Neural network verification
Training for robustness
Certified training
Evaluation metrics
CURE: multi-norm Certified training for Union RobustnEss
Notations
Robust, boundary and union error
Trade-off of $l_q, l_r$ perturbations
Certified training for multiple norms
Improved multi-norm certified training
Bound alignment (BA)
Integrate NT into CT
...and 14 more sections

Key Result

Lemma 4.1

Binary IBP loss is a logistic loss in the classification-calibrated surrogate loss family.

Figures (8)

Figure 1: (a) $l_\infty - l_2$ tradeoff: an $l_\infty$ certified robust model may lack $l_2$ certified robustness and vice versa. CURE-Scratch (yellow) and CURE-Finetune (green) improve union robustness significantly. (b) We align the output bound differences for $l_q, l_r$ perturbations on the correctly certified $l_q$ subset $\gamma$ to mitigate $l_q - l_r$ tradeoff for better union robustness.
Figure 2: $l_q-l_r$ trade-off visualization. The large rectangle represents all input image instances. Blue and Purple points are instances belonging to $\mathcal{R}_q$ and $\mathcal{R}_r$, respectively.
Figure 3: Comparison on CURE against geometric transformations for MNIST $(\epsilon_1=2.0, \epsilon_2=1.0, \epsilon_\infty=0.3)$ experiment. We denote R$(\varphi)$ a rotation of $\pm \varphi$ degrees; Tu$(\Delta u)$ and Tv$(\Delta v)$ a translation of $\pm \Delta u$ pixels horizontally and $\pm \Delta v$ pixels vertically, respectively; Sc$(\lambda)$ a scaling of $\pm \lambda \%$; Sh$(\gamma)$ a shearing of $\pm \gamma \%$; C$(\alpha)$ a contrast change of $\pm \alpha \%$; and B$(\beta)$ a brightness change of $\pm \beta$. CURE improves the geometric certified robustness compared with single norm training. Also, CURE-Scratch achieves the best average geometric transformation robustness.
Figure 4: CURE-Max and CURE-Scratch bound difference visualization.
Figure 5: Alabtion studies on $\lambda_2$, $\eta$ and $\beta$ hyper-parameters.
...and 3 more figures

Theorems & Definitions (4)

Lemma 4.1
Theorem 4.2
Definition 4.3: Correctly Certified $l_r$ Subset
proof

Towards Universal Certified Robustness with Multi-Norm Training

TL;DR

Abstract

Towards Universal Certified Robustness with Multi-Norm Training

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (4)