Provable Robustness of ReLU networks via Maximization of Linear Regions
Francesco Croce, Maksym Andriushchenko, Matthias Hein
TL;DR
The authors tackle provable robustness for ReLU networks by exploiting the piecewise affine structure to define linear regions and decision boundaries. They derive robustness guarantees using distances to region and decision boundaries, and introduce the Maximum Margin Regularizer (MMR) to systematically enlarge linear regions and margins during training. Empirically, MMR improves both lower and upper robustness bounds and enhances verifiability via faster MIP certification, often matching or surpassing adversarial training baselines. The work also enables obtaining guaranteed optimal adversarial perturbations for a substantial fraction of inputs, demonstrating practical impact for certifiable robustness in safety-critical settings.
Abstract
It has been shown that neural network classifiers are not robust. This raises concerns about their usage in safety-critical systems. We propose in this paper a regularization scheme for ReLU networks which provably improves the robustness of the classifier by maximizing the linear regions of the classifier as well as the distance to the decision boundary. Our techniques allow even to find the minimal adversarial perturbation for a fraction of test points for large networks. In the experiments we show that our approach improves upon adversarial training both in terms of lower and upper bounds on the robustness and is comparable or better than the state-of-the-art in terms of test error and robustness.
