Improving Iterative Gaussian Processes via Warm Starting Sequential Posteriors
Alan Yufei Dong, Jihao Andreas Lin, José Miguel Hernández-Lobato
TL;DR
This work tackles the scalability bottleneck of Gaussian process inference in sequential settings by warm-starting iterative linear solvers when new data are added. The authors prove that initializing from the previous solution reduces the initial distance to the final solution in the extended system, yielding substantial speed-ups across conjugate gradients, SGD, and alternating projections, and enabling more accurate posterior samples under compute constraints. Empirically, warm starting accelerates GP regression solves by up to roughly 6x for some solvers and improves Bayesian optimization performance in parallel Thompson sampling under limited compute budgets. Overall, the method enhances GP scalability for online learning and sequential decision-making without extra computation beyond storing prior solutions, with practical impact on active learning, online GP updates, and BO.
Abstract
Scalable Gaussian process (GP) inference is essential for sequential decision-making tasks, yet improving GP scalability remains a challenging problem with many open avenues of research. This paper focuses on iterative GPs, where iterative linear solvers, such as conjugate gradients, stochastic gradient descent or alternative projections, are used to approximate the GP posterior. We propose a new method which improves solver convergence of a large linear system by leveraging the known solution to a smaller system contained within. This is significant for tasks with incremental data additions, and we show that our technique achieves speed-ups when solving to tolerance, as well as improved Bayesian optimisation performance under a fixed compute budget.
