Data-driven informative priors for Bayesian inference with quasi-periodic data
Javier Lopez-Santiago, Luca Martino, Joaquin Miguez, Gonzalo Vazquez-Vilar
TL;DR
The paper tackles Bayesian inference for quasi-periodic data where the period parameter yields a highly concentrated posterior. It introduces a data-driven prior for the period by fitting a Gaussian process with a periodic kernel, then uses the GP hyperparameter posterior as a prior for the parametric model in an empirical Bayes, modular framework via adaptive importance sampling. The method is demonstrated on synthetic sine data and real astrophysical time series (radial velocity and light curves), showing tighter, more informative posteriors for the period and improved inference relative to uninformative priors and purely frequency-based methods. This approach reduces search space dimensionality and enhances robustness in exoplanet and binary-star analyses, with broad applicability to quasi-periodic time series.
Abstract
Bayesian computational strategies for inference can be inefficient in approximating the posterior distribution in models that exhibit some form of periodicity. This is because the probability mass of the marginal posterior distribution of the parameter representing the period is usually highly concentrated in a very small region of the parameter space. Therefore, it is necessary to provide as much information as possible to the inference method through the parameter prior distribution. We intend to show that it is possible to construct a prior distribution from the data by fitting a Gaussian process (GP) with a periodic kernel. More specifically, we want to show that it is possible to approximate the marginal posterior distribution of the hyperparameter corresponding to the period in the kernel. Subsequently, this distribution can be used as a prior distribution for the inference method. We use an adaptive importance sampling method to approximate the posterior distribution of the hyperparameters of the GP. Then, we use the marginal posterior distribution of the hyperparameter related to the periodicity in order to construct a prior distribution for the period of the parametric model. This workflow is empirical Bayes, implemented as a modular (cut) transfer of a GP posterior for the period to the parametric model. We applied the proposed methodology to both synthetic and real data. We approximated the posterior distribution of the period of the GP kernel and then passed it forward as a posterior-as-prior with no feedback. Finally, we analyzed its impact on the marginal posterior distribution.
