Table of Contents
Fetching ...

Symplectic Stiefel manifold: tractable metrics, second-order geometry and Newton's methods

Bin Gao, Nguyen Thanh Son, Tatjana Stykel

TL;DR

This is the first try to develop the explicit second-order geometry and Newton's methods on the symplectic Stiefel manifold and proposes a hybrid Riemannian Newton optimization algorithm that enjoys both global convergence and quadratic/superlinear local convergence at the final stage.

Abstract

Optimization under the symplecticity constraint is an approach for solving various problems in quantum physics and scientific computing. Building on the results that this optimization problem can be transformed into an unconstrained problem on the symplectic Stiefel manifold, we construct geometric ingredients for Riemannian optimization with a new family of Riemannian metrics called tractable metrics and develop Riemannian Newton schemes. The newly obtained ingredients do not only generalize several existing results but also provide us with freedom to choose a suitable metric for each problem. To the best of our knowledge, this is the first try to develop the explicit second-order geometry and Newton's methods on the symplectic Stiefel manifold. For the Riemannian Newton method, we first consider novel operator-valued formulas for computing the Riemannian Hessian of a~cost function, which further allows the manifold to be endowed with a weighted Euclidean metric that can provide a preconditioning effect. We then solve the resulting Newton equation, as the central step of Newton's methods, directly via transforming it into a~saddle point problem followed by vectorization, or iteratively via applying any matrix-free iterative method either to the operator Newton equation or its saddle point formulation. Finally, we propose a hybrid Riemannian Newton optimization algorithm that enjoys both global convergence and quadratic/superlinear local convergence at the final stage. Various numerical experiments are presented to validate the proposed methods.

Symplectic Stiefel manifold: tractable metrics, second-order geometry and Newton's methods

TL;DR

This is the first try to develop the explicit second-order geometry and Newton's methods on the symplectic Stiefel manifold and proposes a hybrid Riemannian Newton optimization algorithm that enjoys both global convergence and quadratic/superlinear local convergence at the final stage.

Abstract

Optimization under the symplecticity constraint is an approach for solving various problems in quantum physics and scientific computing. Building on the results that this optimization problem can be transformed into an unconstrained problem on the symplectic Stiefel manifold, we construct geometric ingredients for Riemannian optimization with a new family of Riemannian metrics called tractable metrics and develop Riemannian Newton schemes. The newly obtained ingredients do not only generalize several existing results but also provide us with freedom to choose a suitable metric for each problem. To the best of our knowledge, this is the first try to develop the explicit second-order geometry and Newton's methods on the symplectic Stiefel manifold. For the Riemannian Newton method, we first consider novel operator-valued formulas for computing the Riemannian Hessian of a~cost function, which further allows the manifold to be endowed with a weighted Euclidean metric that can provide a preconditioning effect. We then solve the resulting Newton equation, as the central step of Newton's methods, directly via transforming it into a~saddle point problem followed by vectorization, or iteratively via applying any matrix-free iterative method either to the operator Newton equation or its saddle point formulation. Finally, we propose a hybrid Riemannian Newton optimization algorithm that enjoys both global convergence and quadratic/superlinear local convergence at the final stage. Various numerical experiments are presented to validate the proposed methods.
Paper Structure (30 sections, 19 theorems, 143 equations, 5 figures, 7 tables, 3 algorithms)

This paper contains 30 sections, 19 theorems, 143 equations, 5 figures, 7 tables, 3 algorithms.

Key Result

proposition thmcounterproposition

The normal space to ${\mathrm{Sp}(2k,2n)}$ at $X\in{\mathrm{Sp}(2k,2n)}$ with respect to the metric $g_{{\bm M}_X}\!$ defined in eq:metricM can be represented as

Figures (5)

  • Figure 1: Geometric illustration of Riemannian ingredients on the symplectic Stiefel manifold
  • Figure 2: Symplectic solution of a matrix least squares problem with $n=50, k = 6$
  • Figure 3: Symplectic solution of a matrix least squares problem with $n=400, k = 10$
  • Figure 4: Symplectic trace minimization with synthetic data: $n=2000$, $k = 5$, $\texttt{tol} = 1\mathrm{e}-8$, $\theta = 1\mathrm{e}-3$
  • Figure 5: Symplectic trace minimization with data from a wire saw model: $n=2000$, $k = 5$, $\texttt{tol} = 1\mathrm{e}-8$, $\theta = 1\mathrm{e}-3$

Theorems & Definitions (37)

  • proposition thmcounterproposition
  • proof
  • proposition thmcounterproposition
  • proof
  • remark thmcounterremark: Preconditioning by metrics
  • theorem 1: Riemannian Hessian
  • lemma thmcounterlemma
  • proof
  • lemma thmcounterlemma
  • proof
  • ...and 27 more