On frequentist confidence intervals in a non-Gaussian regime
Shubham Barua, Shantanu Desai, Mauricio Lopez-Hernandez, Eoin Ó Colgáin
TL;DR
The paper assesses how different frequentist confidence-interval constructions perform when MCMC posteriors are non-Gaussian in a two-parameter $\Lambda$CDM framework using cosmic chronometer data and DESI mocks. It compares graphical profile likelihoods (Wilks and integration) with the Feldman-Cousins (FC) Neyman belt, all driven by the same MCMC chain to ensure a fair baseline. In near-Gaussian regimes the methods align within about 10%, but non-Gaussianity and parameter boundaries lead to noticeable discrepancies, with FC providing robust coverage and sometimes tighter intervals. A key finding is a ~2$\sigma$ shift in $\Omega_m$ between low- and high-redshift CC subsets when mocks are fitted to all parameters, while fixing parameters artificially tightens intervals and can mask non-Gaussian features. The results emphasize cautious use of Gaussian approximations in cosmology, advocate FC/Neyman approaches in non-Gaussian settings, and highlight that the number of fitted parameters strongly shapes interval widths and inferred tensions.
Abstract
We study frequentist confidence intervals based on graphical profile likelihoods (Wilks' theorem, likelihood integration), and the Feldman-Cousins (FC) prescription, a generalisation of the Neyman belt construction, in a setting with non-Gaussian Markov chain Monte Carlo (MCMC) posteriors. Our simplified setting allows us to recycle the MCMC chain as an input in all methods, including mock simulations underlying the FC approach. We find all methods agree to within $10 \%$ in the close to Gaussian regime, but extending methods beyond their regime of validity leads to greater discrepancies. Importantly, we recover a $\sim 2 σ$ shift in cosmological parameters between low and high redshift cosmic chronometer data with the FC method, but only when one fits all parameters back to the mocks. We observe that fixing parameters, a common approach in the literature, risks underestimating confidence intervals.
