Robust Dataset Distillation by Matching Adversarial Trajectories

Wei Lai; Tianyu Ding; ren dongdong; Lei Wang; Jing Huo; Yang Gao; Wenbin Li

Robust Dataset Distillation by Matching Adversarial Trajectories

Wei Lai, Tianyu Ding, ren dongdong, Lei Wang, Jing Huo, Yang Gao, Wenbin Li

TL;DR

This work introduces Robust Dataset Distillation by Matching Adversarial Trajectories (MAT), a framework that embeds adversarial robustness directly into distilled data by aligning smoothed adversarial training trajectories with a teacher trajectory. MAT uses exponential moving average (EMA) smoothing to tame rapid weight changes during adversarial training, enabling effective trajectory matching and robust data synthesis. Experiments on CIFAR-10, CIFAR-100, and Tiny ImageNet show that models trained on MAT-distilled data achieve enhanced adversarial robustness with competitive clean accuracy, across multiple adversarial training regimes (PGD-AT, TRADES, MART) and architectures. The results establish robust dataset distillation as a viable, efficient path to reliable robust learning without the overhead of per-iteration adversarial training on large datasets.

Abstract

Dataset distillation synthesizes compact datasets that enable models to achieve performance comparable to training on the original large-scale datasets. However, existing distillation methods overlook the robustness of the model, resulting in models that are vulnerable to adversarial attacks when trained on distilled data. To address this limitation, we introduce the task of ``robust dataset distillation", a novel paradigm that embeds adversarial robustness into the synthetic datasets during the distillation process. We propose Matching Adversarial Trajectories (MAT), a method that integrates adversarial training into trajectory-based dataset distillation. MAT incorporates adversarial samples during trajectory generation to obtain robust training trajectories, which are then used to guide the distillation process. As experimentally demonstrated, even through natural training on our distilled dataset, models can achieve enhanced adversarial robustness while maintaining competitive accuracy compared to existing distillation methods. Our work highlights robust dataset distillation as a new and important research direction and provides a strong baseline for future research to bridge the gap between efficient training and adversarial robustness.

Robust Dataset Distillation by Matching Adversarial Trajectories

TL;DR

Abstract

Robust Dataset Distillation by Matching Adversarial Trajectories

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)