Waging a Campaign: Results from an Injection-Recovery Study involving 35 numerical Relativity Simulations and three Waveform Models

Sarp Akçay; Charlie Hoy; Jake Mac Uilliam

Waging a Campaign: Results from an Injection-Recovery Study involving 35 numerical Relativity Simulations and three Waveform Models

Sarp Akçay, Charlie Hoy, Jake Mac Uilliam

TL;DR

This paper assesses the accuracy of three state-of-the-art precessing gravitational-wave waveform models (SEOBNRv5PHM, IMRPhenomTPHM, IMRPhenomXPHM) through an extensive injection-recovery campaign using 35 strongly precessing NR BBH simulations, analyzed with a two-detector network and O4-design sensitivity. It quantifies biases via recovery scores and IMR consistency tests, revealing that SEOBNRv5PHM generally provides the most reliable parameter recovery for mass and mass ratio up to $Q\le 4$, while IMRPhenomTPHM shows strong robustness in IMR consistency, and IMRPhenomXPHM exhibits model-dependent biases, especially at high mass ratios. The IMR consistency results depend on the chosen cutoff frequency, with Kerr ISCO cutoffs often reducing apparent GR deviations compared to Schwarzschild cutoffs; no single model consistently passes IMRCT for $Q=8$ injections. The study also demonstrates that incorporating model accuracy into Bayesian inference (NR-informed model averaging) yields more accurate, less biased inferences than equal-weight or evidence-based model combinations, guiding future multi-model PE strategies. Overall, the work informs waveform-model development, motivates multi-model analyses, and provides datasets (NR injections) to benchmark next-generation precessing waveform models.

Abstract

We present Bayesian inference results from an extensive injection-recovery campaign to test the validity of three state of the art quasicircular gravitational waveform models: \textsc{SEOBNRv5PHM}, \textsc{IMRPhenomTPHM}, \textsc{IMRPhenomXPHM}, the latter with the \textsc{SpinTaylorT4} implementation for its precession dynamics. We analyze 35 strongly precessing binary black hole numerical relativity simulations with all available harmonic content. Ten simulations have a mass ratio of $4:1$ and five, mass ratio of $8:1$. Overall, we find that \textsc{SEOBNRv5PHM} is the most consistent model to numerical relativity, with the majority of true source properties lying within the inferred 90\% credible interval. However, we find that none of the models can reliably infer the true source properties for binaries with mass ratio $8:1$ systems. We additionally conduct inspiral-merger-ringdown (IMR) consistency tests to determine if our chosen state of the art waveform models infer consistent properties when analysing only the inspiral (low frequency) and ringdown (high frequency) portions of the signal. For the simulations considered in this work, we find that the IMR consistency test depends on the frequency that separates the inspiral and ringdown regimes. For two sensible choices of the cutoff frequency, we report that \textsc{IMRPhenomXPHM} can produce false GR deviations. Meanwhile, we find that \textsc{IMRPhenomTPHM} is the most reliable model under the IMR consistency test. Finally, we re-analyze the same 35 simulations, but this time we incorporate model accuracy into our Bayesian inference. Consistent with the work in Hoy et al. 2024 [arXiv: 2409.19404], we find this approach generally yields more accurate inferred properties for binary black holes with less biases compared to methods that combine model-dependent posterior distributions based on their evidence, or with equal weight.

Waging a Campaign: Results from an Injection-Recovery Study involving 35 numerical Relativity Simulations and three Waveform Models

TL;DR

, while IMRPhenomTPHM shows strong robustness in IMR consistency, and IMRPhenomXPHM exhibits model-dependent biases, especially at high mass ratios. The IMR consistency results depend on the chosen cutoff frequency, with Kerr ISCO cutoffs often reducing apparent GR deviations compared to Schwarzschild cutoffs; no single model consistently passes IMRCT for

injections. The study also demonstrates that incorporating model accuracy into Bayesian inference (NR-informed model averaging) yields more accurate, less biased inferences than equal-weight or evidence-based model combinations, guiding future multi-model PE strategies. Overall, the work informs waveform-model development, motivates multi-model analyses, and provides datasets (NR injections) to benchmark next-generation precessing waveform models.

Abstract

and five, mass ratio of

. Overall, we find that \textsc{SEOBNRv5PHM} is the most consistent model to numerical relativity, with the majority of true source properties lying within the inferred 90\% credible interval. However, we find that none of the models can reliably infer the true source properties for binaries with mass ratio

systems. We additionally conduct inspiral-merger-ringdown (IMR) consistency tests to determine if our chosen state of the art waveform models infer consistent properties when analysing only the inspiral (low frequency) and ringdown (high frequency) portions of the signal. For the simulations considered in this work, we find that the IMR consistency test depends on the frequency that separates the inspiral and ringdown regimes. For two sensible choices of the cutoff frequency, we report that \textsc{IMRPhenomXPHM} can produce false GR deviations. Meanwhile, we find that \textsc{IMRPhenomTPHM} is the most reliable model under the IMR consistency test. Finally, we re-analyze the same 35 simulations, but this time we incorporate model accuracy into our Bayesian inference. Consistent with the work in Hoy et al. 2024 [arXiv: 2409.19404], we find this approach generally yields more accurate inferred properties for binary black holes with less biases compared to methods that combine model-dependent posterior distributions based on their evidence, or with equal weight.

Waging a Campaign: Results from an Injection-Recovery Study involving 35 numerical Relativity Simulations and three Waveform Models

TL;DR

Abstract

Waging a Campaign: Results from an Injection-Recovery Study involving 35 numerical Relativity Simulations and three Waveform Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)