Fast Evaluation of Additive Kernels: Feature Arrangement, Fourier Methods, and Kernel Derivatives
Theresa Wagner, Franziska Nestler, Martin Stoll
TL;DR
This work tackles the computational bottleneck of dense kernel matrices by combining feature-space partitioning with non-equispaced fast Fourier transforms (NFFT) to accelerate additive Gaussian kernels and their derivatives. It introduces a flexible framework for arranging features into low-dimensional windows, backed by a spectrum of selection, regularization, clustering, and optimization strategies, with an explicit NFFT-based method for both kernel evaluations and derivative computations. The authors derive rigorous Fourier error bounds for the Gaussian and derivative kernels, and develop a global-sensitivity-analysis approach in the Fourier domain to guide window construction. Numerical experiments on large-scale benchmarks show that additive-kernel ridge regression with NFFT acceleration can outperform full-kernel baselines in both accuracy (RMSE) and efficiency, and extendable to Matérn-type kernels. The methods promise scalable, interpretable additive models suitable for GP/SVM contexts, with practical impact in large data regimes and hyperparameter optimization.
Abstract
One of the main computational bottlenecks when working with kernel based learning is dealing with the large and typically dense kernel matrix. Techniques dealing with fast approximations of the matrix vector product for these kernel matrices typically deteriorate in their performance if the feature vectors reside in higher-dimensional feature spaces. We here present a technique based on the non-equispaced fast Fourier transform (NFFT) with rigorous error analysis. We show that this approach is also well suited to allow the approximation of the matrix that arises when the kernel is differentiated with respect to the kernel hyperparameters; a problem often found in the training phase of methods such as Gaussian processes. We also provide an error analysis for this case. We illustrate the performance of the additive kernel scheme with fast matrix vector products on a number of data sets. Our code is available at https://github.com/wagnertheresa/NFFTAddKer
