Costless correction of chain based nested sampling parameter estimation in gravitational wave data and beyond
Metha Prathaban, Will Handley
TL;DR
Nested sampling parameter estimation suffers an additional stochastic uncertainty from parameter variation along iso-likelihood contours, not captured by standard evidence-based error bars. The authors introduce two phantom-point–based approaches—likelihood binning and reconstructed phantom runs—to quantify and validate this uncertainty using extra likelihood evaluations produced during runtime, demonstrated on simulated gravitational-wave BBH signals. They show that these methods yield error bars and p–p plot coverages comparable to or improving upon the Higson bootstrapping method, though some parameters (e.g., luminosity distance) may require longer chains for convergence. The techniques provide single-run verification of error bars, are broadly applicable to any chain-based nested sampler, and have important implications for credible intervals and coverage in gravitational-wave analyses and beyond.
Abstract
Nested sampling parameter estimation differs from evidence estimation, in that it incurs an additional source of uncertainty. This uncertainty affects estimates of parameter means and credible intervals in gravitational wave analyses and beyond, and yet, it is typically not accounted for in standard uncertainty estimation methods. In this paper, we present two novel methods to quantify this uncertainty more accurately for any chain based nested sampler, using the additional likelihood calls made at runtime in producing independent samples. Using injected signals of black hole binary coalescences as an example, we first show concretely that the usual uncertainty estimation method is insufficient to capture the true error bar on parameter estimates. We then demonstrate how the extra points in the chains of chain based samplers may be carefully utilised to estimate this uncertainty correctly, and provide a way to check the accuracy of the resulting error bars. Finally, we discuss how this uncertainty affects $p$-$p$ plots and coverage assessments.
