A Bayesian quantification of consistency in correlated datasets

Fabian Köhlinger; Benjamin Joachimi; Marika Asgari; Massimo Viola; Shahab Joudaki; Tilman Tröster

A Bayesian quantification of consistency in correlated datasets

Fabian Köhlinger, Benjamin Joachimi, Marika Asgari, Massimo Viola, Shahab Joudaki, Tilman Tröster

TL;DR

This paper develops a three-tier Bayesian framework to quantify consistency in correlated datasets, addressing self-consistency and cross-dataset tension in cosmology. It combines (i) Bayes factors for global consistency, (ii) posterior-difference tests from duplicated parameter sets, and (iii) predictive checks in the data domain via translated posterior distributions to diagnose sources of tension. Applied to KiDS-450 cosmic shear, the approach finds no significant internal tension (<3σ) across multiple data splits and shows that much of previously claimed tension can be mitigated by accounting for correlations and updated covariance modelling. The work emphasizes that different tension metrics probe different aspects of the data-model relationship and provides a general, end-to-end methodology for assessing consistency in future large-scale structure surveys.

Abstract

We present three tiers of Bayesian consistency tests for the general case of $correlated$ datasets. Building on duplicates of the model parameters assigned to each dataset, these tests range from Bayesian evidence ratios as a global summary statistic, to posterior distributions of model parameter differences, to consistency tests in the data domain derived from posterior predictive distributions. For each test we motivate meaningful threshold criteria for the internal consistency of datasets. Without loss of generality we focus on mutually exclusive, correlated subsets of the same dataset in this work. As an application, we revisit the consistency analysis of the two-point weak lensing shear correlation functions measured from KiDS-450 data. We split this dataset according to large vs. small angular scales, tomographic redshift bin combinations, and estimator type. We do not find any evidence for significant internal tension in the KiDS-450 data, with significances below $3\, σ$ in all cases. Software and data used in this analysis can be found at http://kids.strw.leidenuniv.nl/sciencedata.php

A Bayesian quantification of consistency in correlated datasets

TL;DR

Abstract

We present three tiers of Bayesian consistency tests for the general case of

datasets. Building on duplicates of the model parameters assigned to each dataset, these tests range from Bayesian evidence ratios as a global summary statistic, to posterior distributions of model parameter differences, to consistency tests in the data domain derived from posterior predictive distributions. For each test we motivate meaningful threshold criteria for the internal consistency of datasets. Without loss of generality we focus on mutually exclusive, correlated subsets of the same dataset in this work. As an application, we revisit the consistency analysis of the two-point weak lensing shear correlation functions measured from KiDS-450 data. We split this dataset according to large vs. small angular scales, tomographic redshift bin combinations, and estimator type. We do not find any evidence for significant internal tension in the KiDS-450 data, with significances below

in all cases. Software and data used in this analysis can be found at http://kids.strw.leidenuniv.nl/sciencedata.php

A Bayesian quantification of consistency in correlated datasets

TL;DR

Abstract

A Bayesian quantification of consistency in correlated datasets

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (19)