DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation
Yifan Wu, Jiawei Du, Ping Liu, Yuewei Lin, Wei Xu, Wenqing Cheng
TL;DR
DD-RobustBench presents the first comprehensive benchmark for evaluating the adversarial robustness of distilled datasets across diverse distillation methods, attacks, and dataset scales, including large-scale ImageNet-1K subsets. It demonstrates that distilled data can offer superior robustness under many conditions, though robustness interacts with the compression ratio IPC and distillation components such as data augmentation and downsampling. The work provides a unified evaluation pipeline, critical insights into how different distillation elements affect robustness, and practical guidance for deploying and extending the benchmark. The benchmark thus offers a valuable tool for advancing robust dataset distillation in real-world applications.
Abstract
Dataset distillation is an advanced technique aimed at compressing datasets into significantly smaller counterparts, while preserving formidable training performance. Significant efforts have been devoted to promote evaluation accuracy under limited compression ratio while overlooked the robustness of distilled dataset. In this work, we introduce a comprehensive benchmark that, to the best of our knowledge, is the most extensive to date for evaluating the adversarial robustness of distilled datasets in a unified way. Our benchmark significantly expands upon prior efforts by incorporating a wider range of dataset distillation methods, including the latest advancements such as TESLA and SRe2L, a diverse array of adversarial attack methods, and evaluations across a broader and more extensive collection of datasets such as ImageNet-1K. Moreover, we assessed the robustness of these distilled datasets against representative adversarial attack algorithms like PGD and AutoAttack, while exploring their resilience from a frequency perspective. We also discovered that incorporating distilled data into the training batches of the original dataset can yield to improvement of robustness.
