An efficient algorithm for the Riemannian logarithm on the Stiefel manifold for a family of Riemannian metrics
Simon Mataigne, Ralf Zimmermann, Nina Miolane
TL;DR
This work tackles the problem of computing the Riemannian logarithm on the Stiefel manifold under a one-parameter family of metrics that includes the Euclidean and canonical cases. It generalizes Zimmermann's efficient matrix-algebraic geodesic algorithm to the $\beta$-metric family, presenting backward and forward iterations, with rigorous local linear convergence results and practical forward variants that avoid heavy nonlinear matrix solves. The paper introduces a BCH-based analysis, efficient initializations, and a quasi-geodesic subproblem framework, accompanied by extensive numerical experiments showing that the accelerated forward variant often outperforms shooting-based methods, especially for a wide range of $\beta$. The work provides reproducible code and benchmarks, and offers insights into convergence radii via probabilistic modeling, highlighting the method's potential for robust and scalable geodesic computations on the Stiefel manifold in optimization, statistics, and machine learning contexts.
Abstract
Since the popularization of the Stiefel manifold for numerical applications in 1998 in a seminal paper from Edelman et al., it has been exhibited to be a key to solve many problems from optimization, statistics and machine learning. In 2021, Hüper et al. proposed a one-parameter family of Riemannian metrics on the Stiefel manifold, subsuming the well-known Euclidean and canonical metrics. Since then, several methods have been proposed to obtain a candidate for the Riemannian logarithm given any metric from the family. Most of these methods are based on the shooting method or rely on optimization approaches. For the canonical metric, Zimmermann proposed in 2017 a particularly efficient method based on a pure matrix-algebraic approach. In this paper, we derive a generalization of this algorithm that works for the one-parameter family of Riemannian metrics. The algorithm is proposed in two versions, termed backward and forward, for which we prove that it conserves the local linear convergence previously exhibited in Zimmermann's algorithm for the canonical metric.
