Bayesian Kernel Regression for Functional Data
Minoru Kusaba, Megumi Iwayama, Ryo Yoshida
TL;DR
This work addresses functional output regression by introducing KRFD, a kernel-based model that predicts functions $Y(X,t)$ from covariates $X$ while leveraging the covariance structure across the output domain via a separable kernel RKHS. The method expresses $Y(X,t)$ as a kernel expansion in $t$ with $X$-dependent weights, leading to a linear-in-parameters form that admits analytic Bayesian estimation and predictive uncertainty through Gaussian posteriors in the RKHS. A sparse-data extension (KRSFD) broadens applicability to irregularly sampled functions, and a representer-theorem perspective links KRFD to multitask learning with separable kernels, while preserving aBayesian treatment of uncertainty not common in classic MTL. Empirical results on dense artificial data, sparse artificial data, and density-of-states predictions for materials demonstrate that KRFD consistently outperforms FLM and sometimes KRR, with KRFD uniquely providing a predictive distribution for new inputs. The work highlights practical pathways for scalable kernel-based functional regression and underscores the method’s potential for materials science and other domains requiring uncertainty-aware functional predictions.
Abstract
In supervised learning, the output variable to be predicted is often represented as a function, such as a spectrum or probability distribution. Despite its importance, functional output regression remains relatively unexplored. In this study, we propose a novel functional output regression model based on kernel methods. Unlike conventional approaches that independently train regressors with scalar outputs for each measurement point of the output function, our method leverages the covariance structure within the function values, akin to multitask learning, leading to enhanced learning efficiency and improved prediction accuracy. Compared with existing nonlinear function-on-scalar models in statistical functional data analysis, our model effectively handles high-dimensional nonlinearity while maintaining a simple model structure. Furthermore, the fully kernel-based formulation allows the model to be expressed within the framework of reproducing kernel Hilbert space (RKHS), providing an analytic form for parameter estimation and a solid foundation for further theoretical analysis. The proposed model delivers a functional output predictive distribution derived analytically from a Bayesian perspective, enabling the quantification of uncertainty in the predicted function. We demonstrate the model's enhanced prediction performance through experiments on artificial datasets and density of states prediction tasks in materials science.
