Information Geometry of Exponentiated Gradient: Convergence beyond L-Smoothness
Yara Elshiaty, Ferdinand Vanmaele, Stefania Petra
TL;DR
The paper addresses minimizing smooth, potentially nonconvex functions on the positive orthant, motivated by Poisson inverse problems. It reframes exponentiated gradient (EG) as Riemannian gradient descent on the Poisson geometry using the $e$-Exp retraction, enabling a Riemannian Armijo line search. The authors prove global convergence under weak conditions and establish finite termination of the line search, highlighting the KL-divergence’s self-concordant-like role in the analysis. Numerical experiments in tomographic reconstruction show that EG with Armijo line search converges faster than interior-point geometry-based RGD, and an accelerated EG variant via geometric CG demonstrates practical benefits.
Abstract
We study the minimization of smooth, possibly nonconvex functions over the positive orthant, a key setting in Poisson inverse problems, using the exponentiated gradient (EG) method. Interpreting EG as Riemannian gradient descent (RGD) with the $e$-Exp map from information geometry as a retraction, we prove global convergence under weak assumptions -- without the need for $L$-smoothness -- and finite termination of Riemannian Armijo line search. Numerical experiments, including an accelerated variant, highlight EG's practical advantages, such as faster convergence compared to RGD based on interior-point geometry.
