Towards Adversarially Robust Dataset Distillation by Curvature Regularization

Eric Xue; Yijiang Li; Haoyang Liu; Peiran Wang; Yifan Shen; Haohan Wang

Towards Adversarially Robust Dataset Distillation by Curvature Regularization

Eric Xue, Yijiang Li, Haoyang Liu, Peiran Wang, Yifan Shen, Haohan Wang

TL;DR

The paper tackles the challenge of embedding adversarial robustness into dataset distillation without incurring the heavy cost of traditional adversarial training. It derives a theoretical bound showing that the upper bound on adversarial loss for distilled data is governed by the curvature of the loss surface, and proposes GUARD, a curvature-regularization strategy, to flatten the loss landscape during distillation. GUARD is integrated into existing DD methods (e.g., SRe2L, DC) and achieves improved robustness with minimal overhead across ImageNette, Tiny ImageNet, and ImageNet-1K, often improving clean accuracy as a byproduct. The work provides robustness guarantees relative to the real data distribution and demonstrates transferability to multiple distillation approaches, suggesting practical impact for efficient, robust DD in large-scale vision tasks.

Abstract

Dataset distillation (DD) allows datasets to be distilled to fractions of their original size while preserving the rich distributional information, so that models trained on the distilled datasets can achieve a comparable accuracy while saving significant computational loads. Recent research in this area has been focusing on improving the accuracy of models trained on distilled datasets. In this paper, we aim to explore a new perspective of DD. We study how to embed adversarial robustness in distilled datasets, so that models trained on these datasets maintain the high accuracy and meanwhile acquire better adversarial robustness. We propose a new method that achieves this goal by incorporating curvature regularization into the distillation process with much less computational overhead than standard adversarial training. Extensive empirical experiments suggest that our method not only outperforms standard adversarial training on both accuracy and robustness with less computation overhead but is also capable of generating robust distilled datasets that can withstand various adversarial attacks. Our implementation is available at: https://github.com/yumozi/GUARD.

Towards Adversarially Robust Dataset Distillation by Curvature Regularization

TL;DR

Abstract

Paper Structure (29 sections, 1 theorem, 18 equations, 3 figures, 7 tables)

This paper contains 29 sections, 1 theorem, 18 equations, 3 figures, 7 tables.

Introduction
Related Works
Dataset Distillation
Adversarial Attacks
Adversarial Defense
Preliminary
Dataset Distillation
Notations
The Limitation of Adversarial Training in Dataset Distillation
Methods
Formulation of the Robust Distillation Problem
Theoretical Bound of Robustness
Geometric Regularization for Adversarially Robust Dataset
Engineering Specification
Experiments
...and 14 more sections

Key Result

Proposition 1

Let $\mathbf{x}^\prime$ be a distilled datum with the label $c$ and satisfies $\|h(\mathbf{x}^\prime) - \mathbb{E}_{\mathbf{x} \sim D_c}[h(\mathbf{x})]\| \le \sigma$, where $h(\cdot)$ is a feature extractor. Assume $\ell(\cdot)$ is convex in $\mathbf{x}$ and $\tilde{\ell}_\rho^{adv}(\cdot)$ is $L$-L

Figures (3)

Figure 1: Visualization of distilled images generated using GUARD with 1 IPC setting from ImageNet.
Figure 2: A comparison between the curvature profiles of a baseline dataset distillation method (left) and GUARD (right) in the form of sorted eigenvalues of the hessian
Figure : GUARD

Theorems & Definitions (1)

Proposition 1

Towards Adversarially Robust Dataset Distillation by Curvature Regularization

TL;DR

Abstract

Towards Adversarially Robust Dataset Distillation by Curvature Regularization

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (1)