Statistical isotropy of the universe and the look-elsewhere effect
Alan H. Guth, Mohammad Hossein Namjoo
TL;DR
The paper critiques a recent claim of strong CMB statistical anisotropy by Jones et al., arguing that two of the four tested anomalies do not probe anisotropy and that look-elsewhere effects significantly dilute the reported significance. It derives the distribution of the product of the four smallest $p$-values among $n_T$ tests, showing that the observed joint $p$-value can arise with a moderate number of tests and is not as extreme as claimed. By estimating test correlations, the authors deduce that 16–50 independent tests could plausibly reduce the joint significance to $3\sigma$ or $2\sigma$, respectively, casting doubt on the anisotropy claim. They compile a list of anomaly tests and propose generalizations, concluding that current data are consistent with $\Lambda$CDM and statistical isotropy, with publication bias and parameter choices likely increasing the number of performed tests.
Abstract
Recently, Jones et al. [arXiv:2310.12859] claimed strong evidence for the statistical anisotropy of the universe. The claim is based on a joint analysis of four different anomaly tests of the cosmic microwave background data, each of which is known to be anomalous, with a lower level of significance. They reported a combined $p$-value of about $3\times 10^{-8}$, which is more than a $5σ$ level of significance. We observe that statistical anisotropy is not even relevant for two of the four considered tests, which seems sufficient to invalidate the authors' claim. Furthermore, even if one reinterprets the claim as evidence against $Λ$CDM rather than statistical anisotropy, we argue that this result significantly suffers from the look-elsewhere effect. Assuming a set of independent (i.e., uncorrelated) tests, we show that if the four tests with the smallest $p$-values are cherry-picked from 10 independent tests, the $p$-value reported by Jones et al. corresponds to only $3σ$ significance. If there are 27 independent tests, the significance falls to $2σ$. These numbers, however, overstate our argument, since the four tests used by Jones et al. are slightly correlated. Determining the correlation of Jones et al.'s tests by comparing their joint $p$-value with the product of the four separate $p$-values, we find that about 16 or 50 tests are sufficient to reduce the significance of Jones et al.'s results to 3$σ$ or 2$σ$ significance, respectively. We also provide a list of anomaly tests discussed in the literature (and propose a few generalizations), suggesting that very plausibly 16 (or even 50) independent tests have been published, and possibly many more have been considered but not published. We conclude that the current data is consistent with the $Λ$CDM model and, in particular, with statistical isotropy.
