Graph Neural Preconditioners for Iterative Solutions of Sparse Linear Systems
Jie Chen
TL;DR
This paper introduces Graph Neural Preconditioners (GNPs) for accelerating iterative solutions of large sparse linear systems by learning a nonlinear matrix inverse approximation $M oughly A^{-1}$ with a scale-equivariant graph neural network. It integrates $M$ into Flexible GMRES to handle nonlinear preconditioning and provides a convergence analysis, a data-generation strategy that targets the bottom spectrum, and a broad SuiteSparse-based evaluation showing strong robustness, predictable construction times, and competitive execution times relative to ILU, AMG, and GMRES on over 800 matrices. Key contributions include the convergence analysis for FGMRES with nonlinear preconditioning, a practical training-data strategy, and a scale-equivariant GNN design that remains effective across diverse matrices. The results suggest that GNPs offer a viable, robust alternative for general-purpose preconditioning with substantial practical impact across domains with ill-conditioned sparse systems.
Abstract
Preconditioning is at the heart of iterative solutions of large, sparse linear systems of equations in scientific disciplines. Several algebraic approaches, which access no information beyond the matrix itself, are widely studied and used, but ill-conditioned matrices remain very challenging. We take a machine learning approach and propose using graph neural networks as a general-purpose preconditioner. They show attractive performance for many problems and can be used when the mainstream preconditioners perform poorly. Empirical evaluation on over 800 matrices suggests that the construction time of these graph neural preconditioners (GNPs) is more predictable and can be much shorter than that of other widely used ones, such as ILU and AMG, while the execution time is faster than using a Krylov method as the preconditioner, such as in inner-outer GMRES. GNPs have a strong potential for solving large-scale, challenging algebraic problems arising from not only partial differential equations, but also economics, statistics, graph, and optimization, to name a few.
