Iterative Methods for Full-Scale Gaussian Process Approximations for Large Spatial Data
Tim Gyger, Reinhard Furrer, Fabio Sigrist
TL;DR
This work tackles the scalability of Gaussian process inference for large spatial data by blending full-scale approximations with covariance tapering (FSA) and developing fast iterative solvers. It introduces a novel FITC preconditioner that dramatically accelerates conjugate gradient convergence and reduces sensitivity to FSA parameters, while enabling efficient log-determinant and gradient computations via stochastic estimators; a fast, simulation-based method yields accurate predictive variances. The authors provide theoretical convergence guarantees and validate them with extensive simulations, showing that the iterative approach matches Cholesky-based accuracy but with substantial speedups, and they extend the methodology to Vecchia approximations. A real-world MODIS terra data study demonstrates practical impact, with iterative inference delivering comparable results to exact methods at roughly an order of magnitude faster computation, and the software is released in open-source C++ with Python/R interfaces.
Abstract
Gaussian processes are flexible probabilistic regression models which are widely used in statistics and machine learning. However, a drawback is their limited scalability to large data sets. To alleviate this, full-scale approximations (FSAs) combine predictive process methods and covariance tapering, thus approximating both global and local structures. We show how iterative methods can be used to reduce computational costs in calculating likelihoods, gradients, and predictive distributions with FSAs. In particular, we introduce a novel preconditioner and show theoretically and empirically that it accelerates the conjugate gradient method's convergence speed and mitigates its sensitivity with respect to the FSA parameters and the eigenvalue structure of the original covariance matrix, and we demonstrate empirically that it outperforms a state-of-the-art pivoted Cholesky preconditioner. Furthermore, we introduce an accurate and fast way to calculate predictive variances using stochastic simulation and iterative methods. In addition, we show how our newly proposed fully independent training conditional (FITC) preconditioner can also be used in iterative methods for Vecchia approximations. In our experiments, it outperforms existing state-of-the-art preconditioners for Vecchia approximations. All methods are implemented in a free C++ software library with high-level Python and R packages.
