Inferring Cosmological Parameters with Evidential Physics-Informed Neural Networks
Hai Siong Tan
TL;DR
The paper develops an evidential physics-informed neural network (E-PINN) framework, augmented with Gaussian Process supervision, to infer cosmological parameters from Pantheon+ SN Ia and BAO data while yielding calibrated posterior uncertainties. By modeling the luminosity-distance relation as a surrogate with a t-distribution likelihood and enforcing a PDE constraint through the Hubble function H(z;Ω), the approach yields posterior distributions for Ω and H0 under ΛCDM, wCDM, and ΛsCDM. Empirical coverage and model-evidence analyses reveal dataset-dependent tensions between Pantheon+ and BAO when trained separately, while combining data reduces discrepancies and yields more coherent posteriors, albeit with larger uncertainties than traditional χ2-based analyses. The work introduces data-informed priors and GP-guided uncertainty learning, offering a flexible, uncertainty-calibrated alternative for cosmological parameter inference. It also points to future improvements, such as model-independent r_d treatments and incorporating data correlations, to further refine the Hubble tension landscape.
Abstract
We examine the use of a novel variant of Physics-Informed Neural Networks to predict cosmological parameters from recent supernovae and baryon acoustic oscillations (BAO) datasets. Our machine learning framework generates uncertainty estimates for target variables and the inferred unknown parameters of the underlying PDE descriptions. Built upon a hybrid of the principles of Evidential Deep Learning, Physics-Informed Neural Networks, Bayesian Neural Networks and Gaussian Processes, our model enables learning of the posterior distribution of the unknown PDE parameters through standard gradient-descent based training. We apply our model to an up-to-date BAO dataset (Bousis et al. 2024) calibrated with the CMB-inferred sound horizon, and the Pantheon$+$ Sne Ia distances (Scolnic et al. 2018), examining the relative effectiveness and mutual consistency among the standard $Λ$CDM, $w$CDM and $Λ_s$CDM models. Unlike previous results arising from the standard approach of minimizing an appropriate $χ^2$ function, the posterior distributions for parameters in various models trained purely on Pantheon$+$ data were found to be largely contained within the $2σ$ contours of their counterparts trained on BAO data. Their posterior medians for $h_0$ were within about $2σ$ of one another, indicating that our machine learning-guided approach provides a different measure of the Hubble tension.
