Learning Acceleration Algorithms for Fast Parametric Convex Optimization with Certified Robustness
Rajiv Sambharya, Jinho Bok, Nikolai Matni, George Pappas
TL;DR
This work tackles fast, robust solution of parametric convex optimization by learning acceleration hyperparameters for momentum-based first-order methods within a finite iteration budget. It unites gradient-based training with a robustness-oriented objective derived from the Performance Estimation Problem (PEP) framework, yielding worst-case guarantees over a parameter set while maintaining empirical performance on distributional samples. The method supports accelerated gradient, proximal gradient, and ADMM-based solvers (including OSQP and SCS), with time-varying iteration weights and time-invariant ADMM parameters. Empirical results across logistic regression, sparse coding, nonnegative least squares, MPC for a quadcopter, and robust Kalman filtering show substantial speedups and strong finite-iteration robustness, even when trained on as few as ten instances. This data-efficient, SDP-backed approach opens practical pathways for robust, fast parametric optimization in control, signal processing, and statistics.
Abstract
We develop a machine-learning framework to learn hyperparameter sequences for accelerated first-order methods (e.g., the step size and momentum sequences in accelerated gradient descent) to quickly solve parametric convex optimization problems with certified robustness. We obtain a strong form of robustness guarantee -- certification of worst-case performance over all parameters within a set after a given number of iterations -- through regularization-based training. The regularization term is derived from the performance estimation problem (PEP) framework based on semidefinite programming, in which the hyperparameters appear as problem data. We show how to use gradient-based training to learn the hyperparameters for several first-order methods: accelerated versions of gradient descent, proximal gradient descent, and alternating direction method of multipliers. Through various numerical examples from signal processing, control, and statistics, we demonstrate that the quality of the solution can be dramatically improved within a budget of iterations, while also maintaining strong robustness guarantees. Notably, our approach is highly data-efficient in that we only use ten training instances in all of the numerical examples.
