Preconditioned Additive Gaussian Processes with Fourier Acceleration
Theresa Wagner, Tianshi Xu, Franziska Nestler, Yuanzhe Xi, Martin Stoll
TL;DR
This work tackles the computational bottlenecks of Gaussian processes by combining a matrix-free, NFFT-based acceleration for kernel-vector multiplications with an additive-kernel construction that reduces effective dimensionality. An AAFN preconditioner tailored to additive kernels speeds up hyperparameter optimization and stabilizes stochastic trace estimates, enabling scalable GP training on higher-dimensional data. The authors provide rigorous Fourier-approximation error bounds for Matérn kernels and show that the derivative computations used in learning are consistent with the Fourier approximations. Numerical experiments on synthetic and real data demonstrate comparable predictive performance and uncertainty quantification to exact methods, with substantial gains in efficiency and scalability.
Abstract
Gaussian processes (GPs) are crucial in machine learning for quantifying uncertainty in predictions. However, their associated covariance matrices, defined by kernel functions, are typically dense and large-scale, posing significant computational challenges. This paper introduces a matrix-free method that utilizes the Non-equispaced Fast Fourier Transform (NFFT) to achieve nearly linear complexity in the multiplication of kernel matrices and their derivatives with vectors for a predetermined accuracy level. To address high-dimensional problems, we propose an additive kernel approach. Each sub-kernel in this approach captures lower-order feature interactions, allowing for the efficient application of the NFFT method and potentially increasing accuracy across various real-world datasets. Additionally, we implement a preconditioning strategy that accelerates hyperparameter tuning, further improving the efficiency and effectiveness of GPs.
