Systematic assessment of the Hubble tension via Bayesian jackknife testing
Thomas Hughes, Michael J. Wilensky, Philip Bull
TL;DR
We address the Hubble tension by applying a Bayesian jackknife approach (CHIBORG) to 16 independent measurements of $H_0$, comparing early- vs late-Universe determinations and exploring phenomenological bias scenarios. We develop a hierarchical bias model and use PolyChord to compute evidences for competing hypotheses, yielding a model-weighted posterior for $H_0$. The results show a preference for late-Universe biases but no single scenario dominates, with a multi-modal $H_0$ posterior spanning $66.7 < H_0 < 72.7$ km/s/Mpc at 95\% credibility. The work highlights the need for improved correlation modeling and physically motivated bias mappings, and provides a public codebase to enable further exploration.
Abstract
Statistically-significant differences in the value of the Hubble parameter are found depending on the measurement method that is used, a result known as the Hubble tension. A variety of ways of comparing, grouping, and excluding measurements have been used to try to explain this, either in terms of physical effects or systematic errors. We present a systematic 'Bayesian jackknife' analysis of 16 independent measurements of the Hubble parameter in an attempt to identify whether the measurements fall into meaningful clusters that would help explain the origin of the tension. After evaluating evidence ratios for the commonly-used split into early- vs late-time measurements, we then study a range of simplified alternative physical scenarios that reflect different physical origins of an apparent bias or shift in the value of $H_0$, assigning phenomenological population parameters to each subset. These include scenarios where specific subsets are biased (e.g. due to unrecognised experimental systematics in the local distance ladder or cosmic microwave background measurements), as well as more cosmologically-motivated cases involving modifications to the expansion history. Many of these scenarios have similar marginal likelihood, but the model where no measurements are biased is strongly disfavoured. Finally, we marginalise over all these scenarios to estimate the 'model agnostic' posterior distribution of $H_0$. The resulting distribution is mildly multi-modal, but modestly favours values near $H_0=68$ km/s/Mpc, with a 95\% credible region of $66.7 < H_0 < 72.7$ km/s/Mpc.
