An Empirical Analysis of the Laplace and Neural Tangent Kernels
Ronaldas Paulius Lencevičius
TL;DR
This work empirically analyzes the practical equivalence between the Laplace kernel and the neural tangent kernel (NTK) in Gaussian process regression, emphasizing the unit-sphere domain $\mathbb{S}^{d-1}$ where their RKHS coincide. It demonstrates that exact kernel matching and posterior matching depend on jointly tuning the NTK depth $D$ and bias $\beta$ against the Laplace length-scale $\ell$, with deeper networks requiring smaller $\beta$ and $\ell$ for alignment. The study finds strong posterior-mean equivalence on $\mathbb{S}^{d-1}$ but limited overlap in $\mathbb{R}^d$, and shows the Gaussian kernel cannot fully replicate NTK/Laplace behavior. It also provides practical tooling (scikit-ntk) and a framework for comparing kernel-based regression methods via GP posteriors and RKHS analysis, highlighting both theoretical and computational implications for kernel design and neural-network-informed kernel methods.
Abstract
The neural tangent kernel is a kernel function defined over the parameter distribution of an infinite width neural network. Despite the impracticality of this limit, the neural tangent kernel has allowed for a more direct study of neural networks and a gaze through the veil of their black box. More recently, it has been shown theoretically that the Laplace kernel and neural tangent kernel share the same reproducing kernel Hilbert space in the space of $\mathbb{S}^{d-1}$ alluding to their equivalence. In this work, we analyze the practical equivalence of the two kernels. We first do so by matching the kernels exactly and then by matching posteriors of a Gaussian process. Moreover, we analyze the kernels in $\mathbb{R}^d$ and experiment with them in the task of regression.
