Optimal Krylov On Average
Qi Luo, Florian Schäfer
TL;DR
This paper tackles the cost of solving linear systems inside Krylov-based inner loops by introducing an adaptive randomized truncation estimator (the AS estimator) that preserves unbiasedness while optimizing the trade-off between variance and computation. By formulating a constrained optimization over truncation probabilities $\mathbb{P}(j)$, it achieves closed-form solutions under a diminishing returns property for methods like CG/CR, with a generalized, unbiased alternative when the property fails. The AS-CG variant is demonstrated to outperform existing RR-CG approaches in speed-variance trade-offs and shows robust performance in GP hyperparameter optimization and competitive physics-informed neural networks. These results indicate that adaptive truncation can substantially improve reliability and efficiency of Krylov solvers in large-scale, data-driven scientific computing tasks.
Abstract
We propose an adaptive randomized truncation estimator for Krylov subspace methods that optimizes the trade-off between the solution variance and the computational cost, while remaining unbiased. The estimator solves a constrained optimization problem to compute the truncation probabilities on the fly, with minimal computational overhead. The problem has a closed-form solution when the improvement of the deterministic algorithm satisfies a diminishing returns property. We prove that obtaining the optimal adaptive truncation distribution is impossible in the general case. Without the diminishing return condition, our estimator provides a suboptimal but still unbiased solution. We present experimental results in GP hyperparameter training and competitive physics-informed neural networks problem to demonstrate the effectiveness of our approach.
