The Easy Path to Robustness: Coreset Selection using Sample Hardness
Pranav Ramesh, Arjun Roy, Deepak Ravikumar, Kaushik Roy, Gopalakrishnan Srinivasan
TL;DR
This paper tackles adversarial robustness from a data-centric viewpoint by introducing EasyCore, a static coreset built from low-AIGN samples to promote smoother decision boundaries. The core idea is that samples with smaller Average Input Gradient Norm, $AIGN$, are prototypical and learned quickly, leading to greater robustness under both standard and TRADES adversarial training. Across CIFAR-10/100 and ImageNet-1K, EasyCore yields up to 7% and 5% improvements in adversarial accuracy, respectively, compared with existing coreset methods. The method is model-agnostic and efficient because AIGN is a dataset property that can be computed once, and EasyCore guides data selection rather than altering training dynamics.
Abstract
Designing adversarially robust models from a data-centric perspective requires understanding which input samples are most crucial for learning resilient features. While coreset selection provides a mechanism for efficient training on data subsets, current algorithms are designed for clean accuracy and fall short in preserving robustness. To address this, we propose a framework linking a sample's adversarial vulnerability to its \textit{hardness}, which we quantify using the average input gradient norm (AIGN) over training. We demonstrate that \textit{easy} samples (with low AIGN) are less vulnerable and occupy regions further from the decision boundary. Leveraging this insight, we present EasyCore, a coreset selection algorithm that retains only the samples with low AIGN for training. We empirically show that models trained on EasyCore-selected data achieve significantly higher adversarial accuracy than those trained with competing coreset methods under both standard and adversarial training. As AIGN is a model-agnostic dataset property, EasyCore is an efficient and widely applicable data-centric method for improving adversarial robustness. We show that EasyCore achieves up to 7\% and 5\% improvement in adversarial accuracy under standard training and TRADES adversarial training, respectively, compared to existing coreset methods.
