Accelerating multigrid with streaming chiral SVD for Wilson fermions in lattice QCD
Travis Whyte, Andreas Stathopoulos, Eloy Romero
TL;DR
The paper tackles critical slowing-down in lattice QCD solvers for Wilson fermions by enlarging the multigrid test-vector basis and then truncating it with a chiral singular value decomposition (CSVD) to form efficient prolongation/restriction operators, $\bm{P}$ and $\bm{R}$. A streaming variant, iCSVD, mitigates storage costs by updating left singular vectors incrementally across streams while preserving near-null-space quality. Numerical experiments on anisotropic and isotropic lattices ($m_\pi \approx 239$ MeV and $m_\pi \approx 220$ MeV) show consistent speedups, with volume scaling tests indicating robust performance across larger volumes. At near-critical mass $m_q \approx m_{crit}$, iCSVD achieves substantial improvements, including a speedup of about 1.7× (approximately 170%) over regular multigrid, and the approach is positioned for integration with least-squares interpolation in future work.
Abstract
A modification to the setup algorithm for the multigrid preconditioner of Wilson fermions in lattice QCD is presented. A larger basis of test vectors than that used in conventional multigrid is calculated by the smoother and truncated by singular value decomposition on the chiral components of the test vectors. The truncated basis is used to form the prolongation and restriction matrices of the multigrid hierarchy. This modification of the setup method is demonstrated to increase the convergence of linear solvers on an anisotropic lattice with $m_π \approx 239$ MeV from the Hadron Spectrum Collaboration and an isotropic lattice with $m_π \approx 220$ MeV from the MILC Collaboration. The lattice volume dependence of the method is also examined. Increasing the number of test vectors improves speedup up to a point, but storing these vectors becomes impossible in limited memory resources such as GPUs. To address storage cost, we implement a \emph{streaming} singular value decomposition of the basis of test vectors on the chiral components and demonstrate a decrease in the number of fine level iterations by a factor of 1.7 for $m_q \approx m_{crit}$.
