Mixed-precision algorithms for solving the Sylvester matrix equation
Andrii Dmytryshyn, Massimiliano Fasi, Nicholas J. Higham, Xiaobo Liu
TL;DR
This work develops two mixed-precision strategies for solving the Sylvester equation $A X + X B = C$ by computing Schur factors in a low-precision format and refining to high precision. The methods rely on an iterative refinement core for perturbed quasi-triangular Sylvester equations, and stabilize the solution by either orthonormalizing unitary factors in high precision or by explicitly inverting near-unitary factors. The authors provide rigorous error and cost analyses, including a flop-based model to compare against the traditional Bartels–Stewart algorithm, and demonstrate through extensive numerical experiments that the new approaches achieve accuracy comparable to high-precision methods and can offer speedups on hardware with native low-precision capabilities. The results highlight practical guidelines for selecting precision combinations and refinement counts, and suggest promising directions for extending the framework to additional precision tiers and related matrix equations.
Abstract
We consider the solution of the Sylvester equation $AX+XB=C$ in mixed precision. We derive a new iterative refinement scheme to solve perturbed quasi-triangular Sylvester equations; our rounding error analysis provides sufficient conditions for convergence and a bound on the attainable relative residual. We leverage this iterative scheme to solve the general Sylvester equation. The new algorithms compute the Schur decomposition of the coefficient matrices $A$ and $B$ in lower than working precision, use the low-precision Schur factors to obtain an approximate solution to the perturbed quasi-triangular equation, and iteratively refine it to obtain a working-precision solution. In order to solve the original equation to working precision, the unitary Schur factors of the coefficient matrices must be unitary to working precision, but this is not the case if the Schur decomposition is computed in low precision. We propose two effective approaches to address this: one is based on re-orthonormalization in working precision, and the other on explicit inversion of the almost-unitary factors. The two mixed-precision algorithms thus obtained are tested on various Sylvester and Lyapunov equations from the literature. Our numerical experiments show that, for both types of equations, the new algorithms are at least as accurate as existing ones. Our cost analysis, on the other hand, suggests that they would typically be faster than mono-precision alternatives if implemented on hardware that natively supports low precision.
