Scalable mixed-domain Gaussian process modeling and model reduction for longitudinal data
Juho Timonen, Harri Lähdesmäki
TL;DR
This work tackles the computational bottlenecks of Gaussian process modeling with mixed continuous and categorical inputs in longitudinal data. It introduces a basis-function, Hilbert-space approximation that yields $O(NM)$ complexity and supports arbitrary observation models, enabling full Bayesian inference for additively decomposed mixed-domain kernels. The authors develop a practical model-reduction workflow combining additive variance decomposition and projection predictive selection, demonstrating effective reduction of model size while preserving predictive performance. Through simulations and real-data experiments (e.g., Canadian weather and US elections), the method achieves near-exact accuracy at a fraction of the runtime and provides interpretable, category-specific and shared effects. An open-source R package lgpr2 accompanies the approach, making scalable, interpretable mixed-domain GP modeling accessible for large longitudinal datasets.
Abstract
Gaussian process (GP) models that combine both categorical and continuous input variables have found use in analysis of longitudinal data and computer experiments. However, standard inference for these models has the typical cubic scaling, and common scalable approximation schemes for GPs cannot be applied since the covariance function is non-continuous. In this work, we derive a basis function approximation scheme for mixed-domain covariance functions, which scales linearly with respect to the number of observations and total number of basis functions. The proposed approach is naturally applicable to also Bayesian GP regression with discrete observation models. We demonstrate the scalability of the approach and compare model reduction techniques for additive GP models in a longitudinal data context. We confirm that we can approximate the exact GP model accurately in a fraction of the runtime compared to fitting the corresponding exact model. In addition, we demonstrate a scalable model reduction workflow for obtaining smaller and more interpretable models when dealing with a large number of candidate predictors.
