On priors and scale cuts in EFT-based full-shape analyses
Anton Chudaykin, Mikhail M. Ivanov, Takahiro Nishimichi
TL;DR
The paper investigates how priors and scale cuts influence EFT-based full-shape analyses of galaxy clustering, identifying two main biases: prior-volume projection and genuine two-loop theory biases that do not vanish with better priors or larger data volumes. By contrasting West Coast (WC) and East Coast (EC) pipelines and testing on the PT Challenge and BOSS-like data, it shows that optimistic scale cuts and inconsistent stochastic modeling in WC lead to significant biases in σ₈ (up to ~5%) and modest biases in ω_cdm, whereas EC3 with multi-probe data and consistent stochastic terms keeps biases well below the statistical errors. The work demonstrates that scale cuts are a major driver of bias, and that including scale-dependent stochastic counterterms, higher-order counterterms, and multiple observables (bispectrum, hexadecapole) reduces both theory and projection biases. Practically, this supports using conservative k_max, simulation-informed priors, and comprehensive data vectors for robust cosmological inference with EFT-based full-shape analyses in current and upcoming surveys like DESI and Euclid.
Abstract
Parameter estimation from galaxy survey data from the full-shape method depends on scale cuts and priors on EFT parameters. The effects of priors, including the so-called ''prior volume'' phenomenon have been originally studied in Ivanov et al. (2019) and subsequent works. In this note, we repeat and extend these tests and also apply them to other priors used in the literature. We point out that in addition to the ''prior volume'' effect there is a more dangerous effect that is largely overlooked: a systematic bias on cosmological parameters due to overoptimistic scale cuts. Unlike the ''prior volume'' effect, this is a genuine systematic bias due to two-loop corrections that does not vanish with better priors or with larger data volumes. Our study is based on the high fidelity BOSS-like PT Challenge simulation data which offer many advantages over analyses based on synthetic data generated with fitting pipelines. We show that some analysis choices associated with the PyBird code, especially the scale cuts, significantly bias parameter recovery, overestimating $σ_8$ by over $5\%$ (equivalent to $1σ$). The bias on measured EFT parameters is even more significant. In contrast, the analysis choices associated with the CLASS-PT code lead to much smaller ($\lesssim 1\%$) shifts in cosmological parameters based on their best-fit values.
