Relaxed Weak Accelerated Proximal Gradient Method: a Unified Framework for Nesterov's Accelerations
Hongda Li, Xianfu Wang
TL;DR
This work addresses accelerated proximal gradient methods for $F(x)= f(x) + g(x)$ by introducing Relaxed Weak Accelerated Proximal Gradient (R-WAPG), which permits momentum sequences that do not strictly follow Nesterov's rule. The authors develop a unified convergence framework using two sequences $(b1_k)$ and $(c1_k)$ and derive bounds that encompass both convex and strongly convex settings, including reductions to FISTA and V-FISTA under special choices. They further present three equivalent representations of R-WAPG, connect it to existing acceleration schemes, and introduce Free R-WAPG, a parameter-free variant that estimates problem constants online. Numerical experiments on simple quadratic problems and LASSO show competitive performance of FR-WAPG, highlighting the practical viability of a parameter-free, non-restarting acceleration framework. The study generalizes acceleration theory and offers a flexible toolkit for analyzing and implementing proximal gradient methods across a spectrum of convex objectives.
Abstract
This paper is devoted to the study of accelerated proximal gradient methods where the sequence that controls the momentum term doesn't follow Nesterov's rule. We propose a relaxed weak accelerated proximal gradient (R-WAPG) method, a generic algorithm that unifies the convergence results for strongly convex and convex problems where the extrapolation constant is characterized by a sequence that is much weaker than Nesterov's rule. Our R-WAPG provides a unified framework for several notable Euclidean variants of FISTA and verifies their convergences. In addition, we provide the convergence rate of strongly convex objective with a constant momentum term. Without using the idea of restarting, we also reformulate R-WAPG as ``Free R-WAPG" so that it doesn't require any parameter. Explorative numerical experiments were conducted to show its competitive advantages.
