Lengthscale-informed sparse grids for kernel methods in high dimensions
Elliot J. Addy, Jonas Latz, Aretha L. Teckentrup
TL;DR
The work introduces lengthscale-informed sparse grids (LISGs) to overcome the curse of dimensionality in kernel interpolation and Gaussian process emulation by embedding axis-wise lengthscale anisotropy into the sparse-grid design. A novel LISG construction and its associated operator $P_{L,\boldsymbol{\nu},\mathbf{p}}$ yield dimension-robust error bounds in the native space of separable Matérn kernels, with explicit counts of evaluation points $N_{d,\mathbf{p}}(L)$. The analysis derives predictive variance bounds for GP emulation and provides a fast, scalable implementation based on the sparse-grid combination technique, enabling experiments up to $d=100$. Numerical results show superior accuracy-efficiency of LISGs compared with isotropic sparse grids and Monte Carlo sampling in highly anisotropic settings, without requiring anisotropic regularity of the target function. The framework handles high-dimensional problems by exploiting lengthscale anisotropy through a penalty vector, offering dimension-robust performance and practical applicability in GP surrogate modelling.
Abstract
Kernel interpolation, especially in the context of Gaussian process emulation, is a widely used technique in surrogate modelling, where the goal is to cheaply approximate an input-output map using a limited number of function evaluations. However, in high-dimensional settings, such methods typically suffer from the curse of dimensionality; the number of required evaluations to achieve a fixed approximation error grows exponentially with the input dimension. To overcome this, a common technique used in high-dimensional approximation methods, such as quasi-Monte Carlo and sparse grids, is to exploit functional anisotropy: the idea that some input dimensions are more 'sensitive' than others. In doing so, such methods can significantly reduce the dimension dependence in the error. In this work, we propose a generalisation of sparse grid methods that incorporates a form of anisotropy encoded by the lengthscale parameter in Matérn kernels. We derive error bounds and perform numerical experiments that show that our approach enables effective emulation over arbitrarily high dimensions for functions exhibiting sufficient anisotropy.
