Adaptive Backtracking Line Search
Joao V. Cavalcanti, Laurent Lessard, Ashia C. Wilson
TL;DR
Adaptive Backtracking Line Search introduces an online, violation-aware step-size factor $\hat{\rho}(v(\alpha_k))$ to replace the fixed backtracking scale in Armijo and descent-lemma line searches, achieving faster iterations with no additional computation. The authors provide convex-case guarantees that ABLS uses no more function evaluations than standard backtracking and extend global convergence guarantees to nonconvex smooth problems, preserving GD and AGD convergence rates. Empirical results across logistic regression, linear inverse problems, Rosenbrock, and matrix factorization demonstrate reduced function/gradient evaluations and shorter runtimes. The work offers a broadly applicable template for adaptive line search that can speed up optimization in deterministic and proximal settings without sacrificing theoretical guarantees.
Abstract
Backtracking line search is foundational in numerical optimization. The basic idea is to adjust the step-size of an algorithm by a constant factor until some chosen criterion (e.g. Armijo, Descent Lemma) is satisfied. We propose a novel way to adjust step-sizes, replacing the constant factor used in regular backtracking with one that takes into account the degree to which the chosen criterion is violated, with no additional computational burden. This light-weight adjustment leads to significantly faster optimization, which we confirm by performing a variety of experiments on over fifteen real world datasets. For convex problems, we prove adaptive backtracking requires no more adjustments to produce a feasible step-size than regular backtracking does. For nonconvex smooth problems, we prove adaptive backtracking enjoys the same guarantees of regular backtracking. Furthermore, we prove adaptive backtracking preserves the convergence rates of gradient descent and its accelerated variant.
