Preconditioned Low-Rank Riemannian Optimization for Symmetric Positive Definite Linear Matrix Equations

Ivan Bioli; Daniel Kressner; Leonardo Robol

Preconditioned Low-Rank Riemannian Optimization for Symmetric Positive Definite Linear Matrix Equations

Ivan Bioli, Daniel Kressner, Leonardo Robol

TL;DR

This work addresses the numerical solution of large-scale multiterm symmetric positive definite matrix equations of the form $A_1 X B_1^\top + \cdots + A_\ell X B_\ell^\top = F$. It proposes a direct low-rank approach on the fixed-rank matrix manifold $\mathcal{M}_r$ using a preconditioned Riemannian Nonlinear Conjugate Gradient method, augmented by various preconditioning strategies and a tangential ADI (tangADI) scheme to approximate inverse actions efficiently. A rank-adaptive framework (RRAM) is developed to balance accuracy and rank growth, combining fixed-rank Riemannian optimization with strategic rank updates. Numerical tests on 2D PDE discretizations, stochastic Galerkin problems, and a modified Rail Lyapunov-type problem demonstrate that the proposed methods—particularly when paired with tangADI and rank adaptivity—achieve competitive wall-clock times and scalable performance compared to iterate-and-truncate approaches and truncated CG. The results highlight the practical impact of structure-exploiting, preconditioned Riemannian optimization for large multiterm matrix equations in PDEs and control applications.

Abstract

This work is concerned with the numerical solution of large-scale symmetric positive definite matrix equations of the form $A_1XB_1^\top + A_2XB_2^\top + \dots + A_\ell X B_\ell^\top = F$, as they arise from discretized partial differential equations and control problems. One often finds that $X$ admits good low-rank approximations, in particular when the right-hand side matrix $F$ has low rank. For $\ell \le 2$ terms, the solution of such equations is well studied and effective low-rank solvers have been proposed, including Alternating Direction Implicit (ADI) methods for Lyapunov and Sylvester equations. For $\ell > 2$, several existing methods try to approach $X$ through combining a classical iterative method, such as the conjugate gradient (CG) method, with low-rank truncation. In this work, we consider a more direct approach that approximates $X$ on manifolds of fixed-rank matrices through Riemannian CG. One particular challenge is the incorporation of effective preconditioners into such a first-order Riemannian optimization method. We propose several novel preconditioning strategies, including a change of metric in the ambient space, preconditioning the Riemannian gradient, and a variant of ADI on the tangent space. Combined with a strategy for adapting the rank of the approximation, the resulting method is demonstrated to be competitive for a number of examples representative for typical applications.

Preconditioned Low-Rank Riemannian Optimization for Symmetric Positive Definite Linear Matrix Equations

TL;DR

This work addresses the numerical solution of large-scale multiterm symmetric positive definite matrix equations of the form

. It proposes a direct low-rank approach on the fixed-rank matrix manifold

using a preconditioned Riemannian Nonlinear Conjugate Gradient method, augmented by various preconditioning strategies and a tangential ADI (tangADI) scheme to approximate inverse actions efficiently. A rank-adaptive framework (RRAM) is developed to balance accuracy and rank growth, combining fixed-rank Riemannian optimization with strategic rank updates. Numerical tests on 2D PDE discretizations, stochastic Galerkin problems, and a modified Rail Lyapunov-type problem demonstrate that the proposed methods—particularly when paired with tangADI and rank adaptivity—achieve competitive wall-clock times and scalable performance compared to iterate-and-truncate approaches and truncated CG. The results highlight the practical impact of structure-exploiting, preconditioned Riemannian optimization for large multiterm matrix equations in PDEs and control applications.

Abstract

This work is concerned with the numerical solution of large-scale symmetric positive definite matrix equations of the form

, as they arise from discretized partial differential equations and control problems. One often finds that

admits good low-rank approximations, in particular when the right-hand side matrix

has low rank. For

terms, the solution of such equations is well studied and effective low-rank solvers have been proposed, including Alternating Direction Implicit (ADI) methods for Lyapunov and Sylvester equations. For

, several existing methods try to approach

through combining a classical iterative method, such as the conjugate gradient (CG) method, with low-rank truncation. In this work, we consider a more direct approach that approximates

on manifolds of fixed-rank matrices through Riemannian CG. One particular challenge is the incorporation of effective preconditioners into such a first-order Riemannian optimization method. We propose several novel preconditioning strategies, including a change of metric in the ambient space, preconditioning the Riemannian gradient, and a variant of ADI on the tangent space. Combined with a strategy for adapting the rank of the approximation, the resulting method is demonstrated to be competitive for a number of examples representative for typical applications.

Paper Structure (22 sections, 2 theorems, 71 equations, 4 figures, 2 algorithms)

This paper contains 22 sections, 2 theorems, 71 equations, 4 figures, 2 algorithms.

Introduction
Brief overview of low-rank Riemannian optimization
Embedded geometry of fixed-rank matrices
Riemannian Nonlinear Conjugate Gradient (R-NLCG)
Preconditioned Riemannian Optimization
Riemannian preconditioning for multiterm matrix equations
Preconditioned inner product with BX = EXD
Preconditioned gradient with PX = EXD
Preconditioned gradient with PX = AX+XB
Preconditioning with PX = AXD + EXB
tangADI: ADI on the tangent space
Classical ADI
Basic form of tangADI
Implementation aspects and complexity
Preconditioned R-NLCG with rank adaptivity
...and 7 more sections

Key Result

Proposition 3.1

Let $E\in{\mathbb{R}^{m\times m}}$, $D\in{\mathbb{R}^{n\times n}}$ be SPD. Given $Z \in {\mathbb{R}^{m\times n}}$ there exists a decomposition $Z = \tilde{U}\tilde{\Sigma} \tilde{V}^\top$ called weighted SVD such that $\tilde{U}\in{\mathbb{R}^{m\times m}}$ is $E$-orthogonal ($\tilde{U}^\top E \tild

Figures (4)

Figure 1: Discretized two-dimensional PDE from \ref{['sec:2dpde']} with $m = n = 10\,000$. Comparison of R-NLCG with fixed rank $r = 12$ and with rank adaptivity (RRAM) as well as truncated CG with two different low-rank truncation strategies. From left to right: relative residual vs. iterations, relative residual vs. time, and rank of approximate solution vs. iterations.
Figure 2: Stochastic Galerkin matrix equation from \ref{['sec:stochasticgalerkin']} for test problem 5 from S-IFISS ($m= 16\,129$, $n = 2\,002$, $\ell = 10$). Comparison of R-NLCG with fixed rank $r = 55$ and with rank adaptivity (RRAM) as well as truncated CG and MultiRB.
Figure 3: Stochastic Galerkin matrix equation from \ref{['sec:stochasticgalerkin']} for test problem 2 from S-IFISS ($m= 16\,129$, $n = 1\,287$, $\ell = 9$). Comparison of R-NLCG with fixed rank $r = 180$ and with rank adaptivity (RRAM) as well as truncated CG and MultiRB.
Figure 4: Modified Bilinear Rail problem ($n = 5\,177$) from \ref{['sec:rail']}. Comparison of R-NLCG on manifold of low-rank symmetric positive semidefinite matrices with fixed rank $r = 150$ and with rank adaptivity (RRAM) as well as truncated CG.

Theorems & Definitions (6)

Proposition 3.1
proof
Proposition 3.2
proof
Remark 3.3
Remark 3.4

Preconditioned Low-Rank Riemannian Optimization for Symmetric Positive Definite Linear Matrix Equations

TL;DR

Abstract

Preconditioned Low-Rank Riemannian Optimization for Symmetric Positive Definite Linear Matrix Equations

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (6)