A New Lineserach for Accelerated Composite Minimization
Reza Rahimi Baghbadorani, Sergio Grammatico, Peyman Mohajerin Esfahani
TL;DR
This work addresses the long-standing challenge of selecting stepsizes in first-order convex optimization without relying on a known global smoothness constant. It introduces a novel zero-order linesearch that relies only on function evaluations, applied to both non-accelerated and accelerated gradient methods through a gradient-mapping framework for composite objectives. The authors prove convergence guarantees, achieving O(1/k) for non-accelerated and O(1/k^2) for accelerated schemes, and demonstrate near-optimal performance on smooth, composite, and non-convex problems. The approach is hyperparameter-free for the composite setting and shows strong empirical performance across diverse problem classes, suggesting broad practical impact for large-scale optimization tasks.
Abstract
The choice of the stepsize in first-order convex optimization is typically based on the smoothness constant and plays a crucial role in the performance of algorithms. Recently, there has been a resurgent interest in introducing adaptive stepsizes that do not explicitly depend on smooth constant. In this paper, we propose a novel linesearch stepsize rule based on function evaluations (i.e., zero-order information) that enjoys provable convergence guarantees for both accelerated and non-accelerated gradient descent. We further discuss the similarities and differences between the proposed stepsize regimes and the existing stepsize rules (including Polyak and Armijo). We numerically benchmark the performance of our proposed algorithms against state-of-the-art methods across three major problems classes of (1) smooth minimization (logistic regression, quadratic programs, log-sum-exponential, and smooth max-cut relaxation) (2) composite minimization ($\ell_1$-regularized least-squares, $\ell_1$-constrained least-squares, and $\ell_1$-regularized logistic regression), and (3) non-convex minimization (cubic minimization). These classes include a wide range of operations research and management applications such as portfolio optimization, discrete choice models, sparse classification and feature selections, high-order optimization and trust-region subproblems.
