Fast Sampling of Cosmological Initial Conditions with Gaussian Neural Posterior Estimation
Oleg Savchenko, Guillermo Franco Abellán, Florian List, Noemi Anau Montel, Christoph Weniger
TL;DR
The paper addresses the challenge of reconstructing cosmological initial conditions (ICs) from late-time large-scale structure data, a high-dimensional, ill-posed inverse problem. It introduces Gaussian Neural Posterior Estimation (NPE) within simulation-based inference (SBI), modeling the IC posterior as Gaussian in the initial field with a diagonal Fourier-space precision, which enables fast, amortised sampling even with full $N$-body forward models. A 3D U-Net-based MAP estimator coupled with a learnable diagonal likelihood precision in Fourier space yields an efficiently trainable model; an analytic Gaussian fit to the precision as a function of wavenumber further enables turning any IC point estimator into a fast sampler. Validation on Quijote simulations shows power spectrum accuracy at the ~1–2% level up to Nyquist and strong cross-correlations, with per-$k$-bin Bayesian coverage indicating well-calibrated uncertainties. The approach is differentiable-free and scalable, offering a practical route to sequential SBI and integration of observational effects in future work.
Abstract
Knowledge of the primordial matter density field from which the large-scale structure of the Universe emerged over cosmic time is of fundamental importance for cosmology. However, reconstructing these cosmological initial conditions from late-time observations is a notoriously difficult task, which requires advanced cosmological simulators and sophisticated statistical methods to explore a multi-million-dimensional parameter space. We show how simulation-based inference (SBI) can be used to tackle this problem and to obtain data-constrained realisations of the primordial dark matter density field in a simulation-efficient way with general non-differentiable simulators. Our method is applicable to full high-resolution dark matter $N$-body simulations and is based on modelling the posterior distribution of the constrained initial conditions to be Gaussian with a diagonal covariance matrix in Fourier space. As a result, we can generate thousands of posterior samples within seconds on a single GPU, orders of magnitude faster than existing methods, paving the way for sequential SBI for cosmological fields. Furthermore, we perform an analytical fit of the estimated dependence of the covariance on the wavenumber, effectively transforming any point-estimator of initial conditions into a fast sampler. We test the validity of our obtained samples by comparing them to the true values with summary statistics and performing a Bayesian consistency test.
