Mirror Descent on Riemannian Manifolds

Jiaxin Jiang; Lei Shi; Jiyuan Tan

Mirror Descent on Riemannian Manifolds

Jiaxin Jiang, Lei Shi, Jiyuan Tan

Abstract

Mirror Descent (MD) is a scalable first-order method widely used in large-scale optimization, with applications in image processing, policy optimization, and neural network training. This paper generalizes MD to optimization on Riemannian manifolds. In particular, we develop a Riemannian Mirror Descent (RMD) framework via reparameterization and further propose a stochastic variant of RMD. We also establish non-asymptotic convergence guarantees for both RMD and stochastic RMD. As an application to the Stiefel manifold, our RMD framework reduces to the Curvilinear Gradient Descent (CGD) method proposed in [26]. Moreover, when specializing the stochastic RMD framework to the Stiefel setting, we obtain a stochastic extension of CGD, which effectively addresses large-scale manifold optimization problems.

Mirror Descent on Riemannian Manifolds

Abstract

Paper Structure (11 sections, 6 theorems, 77 equations, 2 tables, 2 algorithms)

This paper contains 11 sections, 6 theorems, 77 equations, 2 tables, 2 algorithms.

Introduction
Notation and Theoretical Background
Riemannian Mirror Descent: Algorithms and Convergence Theorems
Motivation and Algorithm Framework
Examples of Riemannian Mirror Descent
Stochastic Riemannian Mirror Descent
Convergence Analysis
Numerical Experiments
Linear Eigenvalue Problem
Orthogonal Procrustes Problem
Conclusion

Key Result

Theorem 1

For RMD updates in Algorithm algo:md_rie, assume that Assumptions asp:Phi--asp:bounded_gradient hold. Let $\eta_t\equiv \eta$ be a constant step size satisfying Then the following statements hold:

Theorems & Definitions (23)

definition 1
definition 2
definition 3
definition 4
definition 5
definition 6
definition 7
definition 8
definition 9
remark 1
...and 13 more

Mirror Descent on Riemannian Manifolds

Abstract

Mirror Descent on Riemannian Manifolds

Authors

Abstract

Table of Contents

Key Result

Theorems & Definitions (23)