Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent
Philip Doldo, Derek Everett, Amol Khanna, Andre T Nguyen, Edward Raff
TL;DR
The paper addresses the high computational cost of evaluating adversarial robustness with PGD under the $L_ty$ ball. It introduces PGD$_{CD}$, a simple cycle-detection based early-termination method that halts PGD when a perturbation repeats, preserving the exact robustness estimate while substantially reducing iterations. Empirical results across RobustBench models on ImageNet and CIFAR datasets show 10x–20x speedups with robustness estimates matching standard PGD, enabling scalable robustness evaluations and faster experimentation. The approach remains competitive against stronger attacks like Auto-PGD and facilitates acceleration in related tasks such as adversarial training, reducing compute and energy requirements.
Abstract
Projected Gradient Descent (PGD) under the $L_\infty$ ball has become one of the defacto methods used in adversarial robustness evaluation for computer vision (CV) due to its reliability and efficacy, making a strong and easy-to-implement iterative baseline. However, PGD is computationally demanding to apply, especially when using thousands of iterations is the current best-practice recommendation to generate an adversarial example for a single image. In this work, we introduce a simple novel method for early termination of PGD based on cycle detection by exploiting the geometry of how PGD is implemented in practice and show that it can produce large speedup factors while providing the \emph{exact} same estimate of model robustness as standard PGD. This method substantially speeds up PGD without sacrificing any attack strength, enabling evaluations of robustness that were previously computationally intractable.
