Data-Driven Performance Guarantees for Classical and Learned Optimizers
Rajiv Sambharya, Bartolomeo Stellato
TL;DR
The paper develops a data-driven framework for performance guarantees of optimization algorithms under parametric problem distributions. It provides probabilistic bounds for classical fixed-point optimizers via a sample-convergence bound and PAC-Bayes-based generalization bounds for learned optimizers, including a gradient-based training objective to minimize the PAC-Bayes bound itself. The approach is validated across domains such as image deblurring, robust Kalman filtering, sparse coding (LISTA variants), warm starts, and MAML, showing tighter probabilistic guarantees and, in several cases, bounds that beat observed empirical outcomes. This work offers a practical pathway to reliable, data-informed guarantees for both traditional and learned optimization strategies in settings with fixed iteration budgets. The framework facilitates calibrated, task-aware guarantees that can inform algorithm selection and deployment in signal processing, control, and meta-learning applications.
Abstract
We introduce a data-driven approach to analyze the performance of continuous optimization algorithms using generalization guarantees from statistical learning theory. We study classical and learned optimizers to solve families of parametric optimization problems. We build generalization guarantees for classical optimizers, using a sample convergence bound, and for learned optimizers, using the Probably Approximately Correct (PAC)-Bayes framework. To train learned optimizers, we use a gradient-based algorithm to directly minimize the PAC-Bayes upper bound. Numerical experiments in signal processing, control, and meta-learning showcase the ability of our framework to provide strong generalization guarantees for both classical and learned optimizers given a fixed budget of iterations. For classical optimizers, our bounds are much tighter than those that worst-case guarantees provide. For learned optimizers, our bounds outperform the empirical outcomes observed in their non-learned counterparts.
