Certified Robustness via Dynamic Margin Maximization and Improved Lipschitz Regularization
Mahyar Fazlyab, Taha Entesari, Aniket Roy, Rama Chellappa
TL;DR
This work tackles adversarial robustness by targeting the input-space margin directly rather than solely maximizing the output margin. It introduces CRM, a training framework that couples a differentiable regularizer based on logit-difference Lipschitz constants with a scalable Lipschitz upper-bound estimator, LipLT, to shape the decision boundary efficiently. A key contribution is the derivation of Lipschitz-based surrogates for the certified radius, along with a loop-transformation technique that tightens Lipschitz bounds and scales to multi-layer networks. Empirical results on MNIST, CIFAR-10, and Tiny-ImageNet show competitive or superior certified robustness and improved Lipschitz estimation, with a practical, GPU-friendly implementation. Overall, the paper provides a principled, scalable path to end-to-end robust training with certified guarantees.
Abstract
To improve the robustness of deep classifiers against adversarial perturbations, many approaches have been proposed, such as designing new architectures with better robustness properties (e.g., Lipschitz-capped networks), or modifying the training process itself (e.g., min-max optimization, constrained learning, or regularization). These approaches, however, might not be effective at increasing the margin in the input (feature) space. As a result, there has been an increasing interest in developing training procedures that can directly manipulate the decision boundary in the input space. In this paper, we build upon recent developments in this category by developing a robust training algorithm whose objective is to increase the margin in the output (logit) space while regularizing the Lipschitz constant of the model along vulnerable directions. We show that these two objectives can directly promote larger margins in the input space. To this end, we develop a scalable method for calculating guaranteed differentiable upper bounds on the Lipschitz constant of neural networks accurately and efficiently. The relative accuracy of the bounds prevents excessive regularization and allows for more direct manipulation of the decision boundary. Furthermore, our Lipschitz bounding algorithm exploits the monotonicity and Lipschitz continuity of the activation layers, and the resulting bounds can be used to design new layers with controllable bounds on their Lipschitz constant. Experiments on the MNIST, CIFAR-10, and Tiny-ImageNet data sets verify that our proposed algorithm obtains competitively improved results compared to the state-of-the-art.
