Learning to Discover Iterative Spectral Algorithms
Zihang Liu, Oleg Balabanov, Yaoqing Yang, Michael W. Mahoney
TL;DR
AutoSpec introduces a neural framework to discover spectrum-adaptive iterative algorithms for large-scale NLA and numerical optimization by predicting recurrence coefficients that define an executable matrix polynomial P(·). The approach uses coarse spectral probes and self-supervised task objectives, training on synthetic diagonal operators with transfer to real-world sparse matrices, and it yields learned recurrences that improve convergence and accuracy over baselines while exhibiting Chebyshev-like minimax properties. Two recurrence paradigms are explored: an affine three-term recurrence and a basis-generation followed by learned expansion, both parameterized by a neural engine and implemented in a matrix-free, low-storage fashion. Across eigenvalue problems, linear systems, and matrix-function tasks, AutoSpec achieves orders-of-magnitude improvements in iteration counts or operator-norm residuals, demonstrating robust performance under limited spectral information and suggesting a practical path to automated, state-of-the-art NLA methods.
Abstract
We introduce AutoSpec, a neural network framework for discovering iterative spectral algorithms for large-scale numerical linear algebra and numerical optimization. Our self-supervised models adapt to input operators using coarse spectral information (e.g., eigenvalue estimates and residual norms), and they predict recurrence coefficients for computing or applying a matrix polynomial tailored to a downstream task. The effectiveness of AutoSpec relies on three ingredients: an architecture whose inference pass implements short, executable numerical linear algebra recurrences; efficient training on small synthetic problems with transfer to large-scale real-world operators; and task-defined objectives that enforce the desired approximation or preconditioning behavior across the range of spectral profiles represented in the training set. We apply AutoSpec to discovering algorithms for representative numerical linear algebra tasks: accelerating matrix-function approximation; accelerating sparse linear solvers; and spectral filtering/preconditioning for eigenvalue computations. On real-world matrices, the learned procedures deliver orders-of-magnitude improvements in accuracy and/or reductions in iteration count, relative to basic baselines. We also find clear connections to classical theory: the induced polynomials often exhibit near-equiripple, near-minimax behavior characteristic of Chebyshev polynomials.
