Random-sketching Techniques to Enhance the Numerical Stability of Block Orthogonalization Algorithms for s-step GMRES
Ichitaro Yamazaki, Andrew J. Higgins, Erik G. Boman, Daniel B. Szyld
TL;DR
The paper tackles numerical stability in block orthogonalization for $s$-step GMRES by integrating random sketching into the intra-block orthogonalization, ensuring $O(\epsilon)$ orthogonality when block vectors are numerically full-rank. It introduces RandCholQR and two-stage frameworks (with preprocessing via BCGS-PIP or RandBCGS2) to maintain well-conditioned bases while reducing communication. The authors provide theoretical bounds based on $\mu$-subspace embeddings, present implementation within Trilinos, and demonstrate through GPU-accelerated experiments on Perlmutter that the stabilization comes with modest overhead and scalable performance. Overall, the work delivers a portable, numerically robust approach to CA-Krylov methods with practical impact for large-scale GMRES in high-performance computing.
Abstract
We integrate random sketching techniques into block orthogonalization schemes needed for s-step GMRES. The resulting block orthogonalization schemes generate the basis vectors whose overall orthogonality error is bounded by machine precision as long as each of the corresponding block vectors are numerically full rank. We implement these randomized block orthogonalization schemes using standard distributed-memory linear algebra kernels for s-step GMRES available in the Trilinos software packages. Our performance results on the Perlmutter supercomputer (with four NVIDIA A100 GPUs per node) demonstrate that these randomized techniques can enhance the numerical stability of the orthogonalization and overall solver, without a significant increase in the execution time.
