Learning the Universe: Learning to Optimize Cosmic Initial Conditions with Non-Differentiable Structure Formation Models
Ludvig Doeser, Metin Ata, Jens Jasche
TL;DR
This work tackles reconstructing cosmological initial conditions from non-linear, non-differentiable forward models by introducing LULO, a gradient-free optimizer that learns to update initial-condition fields from data discrepancies while keeping full physics simulations in the loop. The neural optimizer, trained on high-dimensional $128^3$ volumetric data, provides update directions without differentiating through the simulator and uses a line search to adjust step sizes, enabling efficient navigation of non-differentiable landscapes. Empirically, LULO recovers initial conditions from $M_{200\mathrm{c}}$ halo catalogs with cross-correlations $\geq80\%$ down to non-linear scales ($k\sim1\,h\mathrm{Mpc}^{-1}$), and accurately reproduces power spectra, bispectra, halo mass functions, and peculiar velocities. This gradient-free, physics-in-the-loop approach offers a scalable path to non-linear field-level inference, compatible with arbitrary forward models and future enhancements such as higher-resolution simulations and additional cosmological probes.
Abstract
Making the most of next-generation galaxy clustering surveys requires overcoming challenges in complex, non-linear modelling to access the significant amount of information at smaller cosmological scales. Field-level inference has provided a unique opportunity beyond summary statistics to use all of the information of the galaxy distribution. However, addressing current challenges often necessitates numerical modelling that incorporates non-differentiable components, hindering the use of efficient gradient-based inference methods. In this paper, we introduce Learning the Universe by Learning to Optimize (LULO), a gradient-free framework for reconstructing the 3D cosmic initial conditions. Our approach advances deep learning to train an optimization algorithm capable of fitting state-of-the-art non-differentiable simulators to data at the field level. Importantly, the neural optimizer solely acts as a search engine in an iterative scheme, always maintaining full physics simulations in the loop, ensuring scalability and reliability. We demonstrate the method by accurately reconstructing initial conditions from $M_{200\mathrm{c}}$ halos identified in a dark matter-only $N$-body simulation with a spherical overdensity algorithm. The derived dark matter and halo overdensity fields exhibit $\geq80\%$ cross-correlation with the ground truth into the non-linear regime $k \sim 1h$ Mpc$^{-1}$. Additional cosmological tests reveal accurate recovery of the power spectra, bispectra, halo mass function, and velocities. With this work, we demonstrate a promising path forward to non-linear field-level inference surpassing the requirement of a differentiable physics model.
