Efficient Optimization Algorithms for Linear Adversarial Training
Antônio H. RIbeiro, Thomas B. Schön, Dave Zahariah, Francis Bach
TL;DR
This work provides scalable solvers for adversarial training of linear models by exploiting problem structure. It introduces an augmented variable smooth reformulation for classification with projected gradient methods and an iterative reweighted ridge approach for regression, complemented by practical enhancements such as momentum, line search, and variance reduction. The methods yield provable convergence behavior and demonstrate significant speedups over general-purpose solvers like CVXPY while maintaining competitive predictive and adversarial robustness on diverse data. The results suggest that linear adversarial training can be a practical alternative in high-dimensional settings, with potential extensions to multiclass and nonlinear regimes. The accompanying codebase supports reproducibility and further exploration in large-scale applications such as genetics and high-throughput phenotyping.
Abstract
Adversarial training can be used to learn models that are robust against perturbations. For linear models, it can be formulated as a convex optimization problem. Compared to methods proposed in the context of deep learning, leveraging the optimization structure allows significantly faster convergence rates. Still, the use of generic convex solvers can be inefficient for large-scale problems. Here, we propose tailored optimization algorithms for the adversarial training of linear models, which render large-scale regression and classification problems more tractable. For regression problems, we propose a family of solvers based on iterative ridge regression and, for classification, a family of solvers based on projected gradient descent. The methods are based on extended variable reformulations of the original problem. We illustrate their efficiency in numerical examples.
