Table of Contents
Fetching ...

The phase diagram of kernel interpolation in large dimensions

Haobo Zhang, Weihao Lu, Qian Lin

TL;DR

This work analyzes kernel interpolation in the high-dimensional regime where $n \asymp d^{\gamma}$, focusing on the inner product kernel on the sphere. By exploiting the explicit spherical-harmonic eigenstructure and the RKHS interpolation spaces $[\mathcal{H}]^{s}$, it derives exact upper and lower bounds for the variance and bias of the minimum-norm interpolant and translates them into a complete $(s,\gamma)$-phase diagram. The variance scales as $\sigma^{2} \Theta_{\mathbb P}(d^{l-\gamma} + d^{\gamma-l-1})$ and the bias as $\Theta_{\mathbb P}(d^{-(l+1)s} + d^{(2-\tilde s)l-2\gamma})$, with $l=\lfloor \gamma \rfloor$ and $\tilde s = \min\{s,2\}$, yielding a total generalization error that underpins regions of minimax optimality, sub-optimality, and inconsistency. A sharp minimax lower bound aligns with the upper bounds, establishing the first complete phase diagram for kernel interpolation in large dimensions and clarifying when benign overfitting can occur. The results have implications for understanding kernel-based generalization in highly parameterized regimes and guide design choices in high-dimensional learning settings.

Abstract

The generalization ability of kernel interpolation in large dimensions (i.e., $n \asymp d^γ$ for some $γ>0$) might be one of the most interesting problems in the recent renaissance of kernel regression, since it may help us understand the 'benign overfitting phenomenon' reported in the neural networks literature. Focusing on the inner product kernel on the sphere, we fully characterized the exact order of both the variance and bias of large-dimensional kernel interpolation under various source conditions $s\geq 0$. Consequently, we obtained the $(s,γ)$-phase diagram of large-dimensional kernel interpolation, i.e., we determined the regions in $(s,γ)$-plane where the kernel interpolation is minimax optimal, sub-optimal and inconsistent.

The phase diagram of kernel interpolation in large dimensions

TL;DR

This work analyzes kernel interpolation in the high-dimensional regime where , focusing on the inner product kernel on the sphere. By exploiting the explicit spherical-harmonic eigenstructure and the RKHS interpolation spaces , it derives exact upper and lower bounds for the variance and bias of the minimum-norm interpolant and translates them into a complete -phase diagram. The variance scales as and the bias as , with and , yielding a total generalization error that underpins regions of minimax optimality, sub-optimality, and inconsistency. A sharp minimax lower bound aligns with the upper bounds, establishing the first complete phase diagram for kernel interpolation in large dimensions and clarifying when benign overfitting can occur. The results have implications for understanding kernel-based generalization in highly parameterized regimes and guide design choices in high-dimensional learning settings.

Abstract

The generalization ability of kernel interpolation in large dimensions (i.e., for some ) might be one of the most interesting problems in the recent renaissance of kernel regression, since it may help us understand the 'benign overfitting phenomenon' reported in the neural networks literature. Focusing on the inner product kernel on the sphere, we fully characterized the exact order of both the variance and bias of large-dimensional kernel interpolation under various source conditions . Consequently, we obtained the -phase diagram of large-dimensional kernel interpolation, i.e., we determined the regions in -plane where the kernel interpolation is minimax optimal, sub-optimal and inconsistent.
Paper Structure (14 sections, 19 theorems, 97 equations, 1 figure)

This paper contains 14 sections, 19 theorems, 97 equations, 1 figure.

Key Result

Theorem 3.0.1

Let $c_{1} d^{\gamma} \le n \le c_{2} d^{\gamma}$ for some fixed $\gamma >0$ and absolute constants $c_{1}, c_{2}$. Consider model Eq.main data model and let $k = k_{d}$ be the inner product kernel on the sphere satisfying Assumption assumption inner product kernel. Denote $l = \lfloor \gamma \rfloo where $\Theta_{\mathbb{P}}$ only involves constants depending on $\gamma$, $c_{1}$ and $c_{2}$.

Figures (1)

  • Figure 1: An illustration of the optimal, sub-optimal and inconsistent regions. When $(s,\gamma)$ falls in the region below the curve (including the boundary), kernel interpolation is optimal; When $(s,\gamma)$ falls in the region above the curve (not including the boundary), kernel interpolation is sub-optimal; When $s=0$ or $\gamma \in \mathbb{N}^{+}$, kernel interpolation is inconsistent.

Theorems & Definitions (30)

  • Theorem 3.0.1
  • Theorem 3.0.2
  • Corollary 3.0.3
  • Proposition 4.2.1: Theorem 5 in zhang2024optimal
  • Lemma 5.2.1
  • Lemma 5.2.2
  • Lemma 5.2.3
  • Lemma 5.2.4
  • proof
  • Lemma 5.2.5
  • ...and 20 more