Sketched and Truncated Polynomial Krylov Subspace Methods: Matrix Sylvester Equations
Davide Palitta, Marcel Schweitzer, Valeria Simoncini
TL;DR
The paper tackles large-scale Sylvester equations $\mathbf{A}\mathbf{X}+\mathbf{X}\mathbf{B}=\mathbf{C}_1\mathbf{C}_2^{\top}$ by introducing a sketched-and-truncated polynomial Krylov approach that leverages oblivious subspace embeddings to reduce orthogonalization cost and memory. Central ideas include WS-Arnoldi whitening, a reduced Galerkin projection for $\mathbf{Y}_d$, a two-pass reconstruction to recover the full low-rank solution, and a theoretical framework based on effective fields of values to ensure well-posedness and convergence. Theoretical results cover field-of-values distortions, Lyapunov-convergence bounds under sketching, and bounds on the distance to full Arnoldi approximations. Numerical experiments demonstrate substantial memory and time savings, competitive performance against RKSM, and robustness to non-orthogonal bases, confirming the practical viability of sketched-and-truncated Krylov projections for large matrix equations.
Abstract
Thanks to its great potential in reducing both computational cost and memory requirements, combining sketching and Krylov subspace techniques has attracted a lot of attention in the recent literature on projection methods for linear systems, matrix function approximations, and eigenvalue problems. Applying this appealing strategy in the context of linear matrix equations turns out to be far more involved than a straightforward generalization. These difficulties include analyzing well-posedness of the projected problem and deriving possible error estimates depending on the sketching properties. Further computational complications include the lack of a natural residual norm estimate and of an explicit basis for the generated subspace. In this paper we propose a new sketched-and-truncated polynomial Krylov subspace method for Sylvester equations that aims to address all these issues. The potential of our novel approach, in terms of both computational time and storage demand, is illustrated with numerical experiments. Comparisons with a state-of-the-art projection scheme based on rational Krylov subspaces are also included.
