Kernel-, mean- and noise-marginalised Gaussian processes for exoplanet transits and $H_0$ inference
Namu Kroupa, David Yallup, Will Handley, Michael Hobson
TL;DR
The paper presents a fully Bayesian Gaussian Process regression framework that marginalises over kernel choice and hyperparameters, using transdimensional sampling and nested sampling to compute kernel evidences $Z_k$ and kernel posteriors $p_k$. This approach enables direct kernel comparison, principled marginalisation of kernel uncertainty, and robust inference for both exoplanet transit signals and cosmological $H_0$ measurements, including consideration of mean and noise models. Key findings show that kernel marginalisation recovers the true kernel in favorable (high-SNR) regimes, reduces bias in mean-function hyperparameters, and yields $H_0$ values consistent with external benchmarks within uncertainties, while avoiding the inductive bias of pre-selected kernels. The method is demonstrated on synthetic transit data and real cosmic chronometer plus BAO data, with kernel posteriors guiding interpretation and offering a path toward more reliable GP analyses in astrophysics and cosmology.
Abstract
Using a fully Bayesian approach, Gaussian Process regression is extended to include marginalisation over the kernel choice and kernel hyperparameters. In addition, Bayesian model comparison via the evidence enables direct kernel comparison. The calculation of the joint posterior was implemented with a transdimensional sampler which simultaneously samples over the discrete kernel choice and their hyperparameters by embedding these in a higher-dimensional space, from which samples are taken using nested sampling. Kernel recovery and mean function inference were explored on synthetic data from exoplanet transit light curve simulations. Subsequently, the method was extended to marginalisation over mean functions and noise models and applied to the inference of the present-day Hubble parameter, $H_0$, from real measurements of the Hubble parameter as a function of redshift, derived from the cosmologically model-independent cosmic chronometer and $Λ$CDM-dependent baryon acoustic oscillation observations. The inferred $H_0$ values from the cosmic chronometers, baryon acoustic oscillations and combined datasets are $H_0= 66 \pm 6\, \mathrm{km}\,\mathrm{s}^{-1}\,\mathrm{Mpc}^{-1}$, $H_0= 67 \pm 10\, \mathrm{km}\,\mathrm{s}^{-1}\,\mathrm{Mpc}^{-1}$ and $H_0= 69 \pm 6\, \mathrm{km}\,\mathrm{s}^{-1}\,\mathrm{Mpc}^{-1}$, respectively. The kernel posterior of the cosmic chronometers dataset prefers a non-stationary linear kernel. Finally, the datasets are shown to be not in tension with $\ln R=12.17\pm 0.02$.
