Approximate Bregman proximal gradient algorithm with variable metric Armijo--Wolfe line search
Kiwamu Fujiki, Shota Takahashi, Akiko Takeda
TL;DR
This work addresses composite nonconvex optimization of the form $\min_x f(x)+g(x)$ with $f$ smooth and potentially non-Lipschitz, proposing the ABPG-VMAW method that combines an approximate Bregman proximal step with a variable-metric Armijo--Wolfe line search. The authors establish global subsequential convergence and, under the KL property, global convergence to a stationary point, even when $g$ is nonzero. They demonstrate practical effectiveness on $\ell_p$ regularized least squares and nonnegative linear inverse problems, where ABPG-VMAW outperforms ABPG and standard proximal gradient methods. The results advance ABPG-type algorithms by enabling larger, more robust steps while preserving convergence guarantees, with implications for efficiency in signal processing and machine learning applications that involve nonconvex penalties and constraints.
Abstract
We propose a variant of the approximate Bregman proximal gradient (ABPG) algorithm for minimizing the sum of a smooth nonconvex function and a nonsmooth convex function. Although ABPG is known to converge globally to a stationary point even when the smooth part of the objective function lacks globally Lipschitz continuous gradients, and its iterates can often be expressed in closed form, ABPG relies on an Armijo line search to guarantee global convergence. Such reliance can slow down performance in practice. To overcome this limitation, we propose the ABPG with a variable metric Armijo--Wolfe line search. Under the variable metric Armijo--Wolfe condition, we establish the global subsequential convergence of our algorithm. Moreover, assuming the Kurdyka--Łojasiewicz property, we also establish that our algorithm globally converges to a stationary point. Numerical experiments on $\ell_p$ regularized least squares problems and nonnegative linear inverse problems demonstrate that our algorithm outperforms existing algorithms.
