A subspace-conjugate gradient method for linear matrix equations
Davide Palitta, Martina Iannacito, Valeria Simoncini
TL;DR
This work addresses solving large-scale multiterm linear matrix equations $\mathcal{L}(X)=C$ with symmetric positive definite $\mathcal{L}$ by introducing a preconditioned subspace-conjugate gradient method (ss-cg). Unlike standard matrix-oriented CG that updates along single-vector directions, ss-cg leverages wide subspaces generated by low-rank factor information and enforces $\mathcal{L}$-orthogonality over these subspaces, aided by a randomized range finder to control memory. The method extends to multiterm Sylvester equations and integrates memory-saving truncations, inexact coefficient solves, and preconditioning (including LR-ADI-based approaches) to handle very large problems. Numerical experiments on Lyapunov and Sylvester problems demonstrate strong convergence, reduced memory usage, and competitiveness against existing methods across diverse applications. The approach provides a scalable framework for large-scale matrix equations in control, stochastic PDEs, and related areas.
Abstract
The efficient solution of large-scale multiterm linear matrix equations is a challenging task in numerical linear algebra, and it is a largely open problem. We propose a new iterative scheme for symmetric and positive definite operators, significantly advancing methods such as truncated matrix-oriented Conjugate Gradients (CG). The new algorithm capitalizes on the low-rank matrix format of its iterates by fully exploiting the subspace information of the factors as iterations proceed. The approach implicitly relies on orthogonality conditions imposed over much larger subspaces than in CG, unveiling insightful connections with subspace projection methods. The new method is also equipped with memory-saving strategies. In particular, we show that for a given matrix $\mathbf{Y}$, the action $\mathcal{L}(\mathbf{Y})$ in low rank format may not be evaluated exactly due to memory constraints. This problem is often underestimated, though it will eventually produce Out-of-Memory breakdowns for a sufficiently large number of terms. We propose an ad-hoc randomized range-finding strategy that appears to fully resolve this shortcoming. Experimental results with typical application problems illustrate the potential of our approach over various methods developed in the recent literature.
