Gaussian Process Regression under Computational and Epistemic Misspecification

Daniel Sanz-Alonso; Ruiyi Yang

Gaussian Process Regression under Computational and Epistemic Misspecification

Daniel Sanz-Alonso, Ruiyi Yang

TL;DR

This paper develops a unified framework to analyze Gaussian process regression under simultaneous epistemic and computational misspecification, focusing on interpolation error when the employed kernel is a misspecified approximation of the true kernel. The authors bound the misspecified kriging error by decomposing it into a well-understood interpolation error plus an approximation error controlled via a bridge function $f_N$ in the misspecified RKHS, and a stability term. They instantiate the framework in four families of kernel misspecifications: Karhunen-Loève truncations, wavelet-based multiscale kernels, and finite element representations, along with a Matérn-parameter misspecification example, deriving explicit $L^2$ and $L^{\\infty}$ rates that depend on the fill distance $h_{n,\\Omega}$, the complexity parameter $N$, and regularity indices. The results recover known bounds in the noiseless Matérn setting and provide new rates and design guidelines for scalable GP interpolation under practical kernel-approximation strategies, informing how to balance approximation complexity against the number of observations. Together, these insights offer theoretical guidance for constructing scalable GP surrogates with provable accuracy in large-data settings and for choosing kernel-complexity targets to meet a desired error tolerance.

Abstract

Gaussian process regression is a classical kernel method for function estimation and data interpolation. In large data applications, computational costs can be reduced using low-rank or sparse approximations of the kernel. This paper investigates the effect of such kernel approximations on the interpolation error. We introduce a unified framework to analyze Gaussian process regression under important classes of computational misspecification: Karhunen-Loève expansions that result in low-rank kernel approximations, multiscale wavelet expansions that induce sparsity in the covariance matrix, and finite element representations that induce sparsity in the precision matrix. Our theory also accounts for epistemic misspecification in the choice of kernel parameters.

Gaussian Process Regression under Computational and Epistemic Misspecification

TL;DR

in the misspecified RKHS, and a stability term. They instantiate the framework in four families of kernel misspecifications: Karhunen-Loève truncations, wavelet-based multiscale kernels, and finite element representations, along with a Matérn-parameter misspecification example, deriving explicit

and

rates that depend on the fill distance

, the complexity parameter

, and regularity indices. The results recover known bounds in the noiseless Matérn setting and provide new rates and design guidelines for scalable GP interpolation under practical kernel-approximation strategies, informing how to balance approximation complexity against the number of observations. Together, these insights offer theoretical guidance for constructing scalable GP surrogates with provable accuracy in large-data settings and for choosing kernel-complexity targets to meet a desired error tolerance.

Abstract

Paper Structure (31 sections, 25 theorems, 136 equations, 1 table)

This paper contains 31 sections, 25 theorems, 136 equations, 1 table.

Introduction
Comparison to related work
Outline
Notation
Preliminaries
Problem formulation
Experimental region and design points
Unified theoretical framework
Main Result
Proof of the Main Result
(Part I)
(Part II)
Epistemic and computational misspecification
Epistemic misspecification in the Matérn model
Background
...and 16 more sections

Key Result

theorem 2

Let $\Phi_N:\mathcal{D}\times \mathcal{D}\rightarrow \mathbb{R}$ be a covariance function whose RKHS satisfies $\mathcal{H}_{\Phi_{\scaleto{N}{2.5pt}}} \subset H^{s_{\scaleto{N}{2.5pt}}} (\mathcal{D})$ for $s_N>\frac{d}{2}$ with equivalent norms: there exists $A_N>1$ such that $A_N^{-1}\|g\|_{\math If $\lambda=0$, the constant $C_{s_{\scaleto{0}{2.5pt}},s_{\scaleto{N}{2.5pt}},d,\Omega}$ can be im

Theorems & Definitions (52)

theorem 2
Remark 3
Remark 4
Remark 5
Remark 6
Lemma 7: arcangeli2012extension and narcowich2005sobolev
Lemma 8: wendland2005approximate
Lemma 9
Lemma 10
proof : Proof of Lemma \ref{['lemma:approx interpolation error']}
...and 42 more

Gaussian Process Regression under Computational and Epistemic Misspecification

TL;DR

Abstract

Gaussian Process Regression under Computational and Epistemic Misspecification

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (52)