Table of Contents
Fetching ...

Preconditioned Low-Rank Riemannian Optimization for Symmetric Positive Definite Linear Matrix Equations

Ivan Bioli, Daniel Kressner, Leonardo Robol

TL;DR

This work addresses the numerical solution of large-scale multiterm symmetric positive definite matrix equations of the form $A_1 X B_1^\top + \cdots + A_\ell X B_\ell^\top = F$. It proposes a direct low-rank approach on the fixed-rank matrix manifold $\mathcal{M}_r$ using a preconditioned Riemannian Nonlinear Conjugate Gradient method, augmented by various preconditioning strategies and a tangential ADI (tangADI) scheme to approximate inverse actions efficiently. A rank-adaptive framework (RRAM) is developed to balance accuracy and rank growth, combining fixed-rank Riemannian optimization with strategic rank updates. Numerical tests on 2D PDE discretizations, stochastic Galerkin problems, and a modified Rail Lyapunov-type problem demonstrate that the proposed methods—particularly when paired with tangADI and rank adaptivity—achieve competitive wall-clock times and scalable performance compared to iterate-and-truncate approaches and truncated CG. The results highlight the practical impact of structure-exploiting, preconditioned Riemannian optimization for large multiterm matrix equations in PDEs and control applications.

Abstract

This work is concerned with the numerical solution of large-scale symmetric positive definite matrix equations of the form $A_1XB_1^\top + A_2XB_2^\top + \dots + A_\ell X B_\ell^\top = F$, as they arise from discretized partial differential equations and control problems. One often finds that $X$ admits good low-rank approximations, in particular when the right-hand side matrix $F$ has low rank. For $\ell \le 2$ terms, the solution of such equations is well studied and effective low-rank solvers have been proposed, including Alternating Direction Implicit (ADI) methods for Lyapunov and Sylvester equations. For $\ell > 2$, several existing methods try to approach $X$ through combining a classical iterative method, such as the conjugate gradient (CG) method, with low-rank truncation. In this work, we consider a more direct approach that approximates $X$ on manifolds of fixed-rank matrices through Riemannian CG. One particular challenge is the incorporation of effective preconditioners into such a first-order Riemannian optimization method. We propose several novel preconditioning strategies, including a change of metric in the ambient space, preconditioning the Riemannian gradient, and a variant of ADI on the tangent space. Combined with a strategy for adapting the rank of the approximation, the resulting method is demonstrated to be competitive for a number of examples representative for typical applications.

Preconditioned Low-Rank Riemannian Optimization for Symmetric Positive Definite Linear Matrix Equations

TL;DR

This work addresses the numerical solution of large-scale multiterm symmetric positive definite matrix equations of the form . It proposes a direct low-rank approach on the fixed-rank matrix manifold using a preconditioned Riemannian Nonlinear Conjugate Gradient method, augmented by various preconditioning strategies and a tangential ADI (tangADI) scheme to approximate inverse actions efficiently. A rank-adaptive framework (RRAM) is developed to balance accuracy and rank growth, combining fixed-rank Riemannian optimization with strategic rank updates. Numerical tests on 2D PDE discretizations, stochastic Galerkin problems, and a modified Rail Lyapunov-type problem demonstrate that the proposed methods—particularly when paired with tangADI and rank adaptivity—achieve competitive wall-clock times and scalable performance compared to iterate-and-truncate approaches and truncated CG. The results highlight the practical impact of structure-exploiting, preconditioned Riemannian optimization for large multiterm matrix equations in PDEs and control applications.

Abstract

This work is concerned with the numerical solution of large-scale symmetric positive definite matrix equations of the form , as they arise from discretized partial differential equations and control problems. One often finds that admits good low-rank approximations, in particular when the right-hand side matrix has low rank. For terms, the solution of such equations is well studied and effective low-rank solvers have been proposed, including Alternating Direction Implicit (ADI) methods for Lyapunov and Sylvester equations. For , several existing methods try to approach through combining a classical iterative method, such as the conjugate gradient (CG) method, with low-rank truncation. In this work, we consider a more direct approach that approximates on manifolds of fixed-rank matrices through Riemannian CG. One particular challenge is the incorporation of effective preconditioners into such a first-order Riemannian optimization method. We propose several novel preconditioning strategies, including a change of metric in the ambient space, preconditioning the Riemannian gradient, and a variant of ADI on the tangent space. Combined with a strategy for adapting the rank of the approximation, the resulting method is demonstrated to be competitive for a number of examples representative for typical applications.
Paper Structure (22 sections, 2 theorems, 71 equations, 4 figures, 2 algorithms)

This paper contains 22 sections, 2 theorems, 71 equations, 4 figures, 2 algorithms.

Key Result

Proposition 3.1

Let $E\in{\mathbb{R}^{m\times m}}$, $D\in{\mathbb{R}^{n\times n}}$ be SPD. Given $Z \in {\mathbb{R}^{m\times n}}$ there exists a decomposition $Z = \tilde{U}\tilde{\Sigma} \tilde{V}^\top$ called weighted SVD such that $\tilde{U}\in{\mathbb{R}^{m\times m}}$ is $E$-orthogonal ($\tilde{U}^\top E \tild

Figures (4)

  • Figure 1: Discretized two-dimensional PDE from \ref{['sec:2dpde']} with $m = n = 10\,000$. Comparison of R-NLCG with fixed rank $r = 12$ and with rank adaptivity (RRAM) as well as truncated CG with two different low-rank truncation strategies. From left to right: relative residual vs. iterations, relative residual vs. time, and rank of approximate solution vs. iterations.
  • Figure 2: Stochastic Galerkin matrix equation from \ref{['sec:stochasticgalerkin']} for test problem 5 from S-IFISS ($m= 16\,129$, $n = 2\,002$, $\ell = 10$). Comparison of R-NLCG with fixed rank $r = 55$ and with rank adaptivity (RRAM) as well as truncated CG and MultiRB.
  • Figure 3: Stochastic Galerkin matrix equation from \ref{['sec:stochasticgalerkin']} for test problem 2 from S-IFISS ($m= 16\,129$, $n = 1\,287$, $\ell = 9$). Comparison of R-NLCG with fixed rank $r = 180$ and with rank adaptivity (RRAM) as well as truncated CG and MultiRB.
  • Figure 4: Modified Bilinear Rail problem ($n = 5\,177$) from \ref{['sec:rail']}. Comparison of R-NLCG on manifold of low-rank symmetric positive semidefinite matrices with fixed rank $r = 150$ and with rank adaptivity (RRAM) as well as truncated CG.

Theorems & Definitions (6)

  • Proposition 3.1
  • proof
  • Proposition 3.2
  • proof
  • Remark 3.3
  • Remark 3.4