Benchmarking AI-evolved cosmological structure formation
Xiaofeng Dong, Nesar Ramachandra, Salman Habib, Katrin Heitmann
TL;DR
This work assesses the feasibility of using a 3-D U-Net to emulate cosmological structure formation by learning a forward map from initial Lagrangian displacement fields to evolved fields, trained on ZA and PM datasets. It defines a comprehensive, physics-motivated benchmark suite—including density PDFs, power spectra, bispectra, percolation, and cross-power tests—to quantify fidelity beyond traditional statistics. A standard mean-squared error loss underperforms on small scales, while a density-weighted loss significantly improves accuracy for dense, nonlinear structures and key clustering statistics, illustrating how physics-informed objectives can boost scientific emulation. The study shows potential for fast covariance generation and parameter inference, while also outlining challenges in scalability, boundary effects, and the need for integrating physics constraints and multi-resolution approaches in future AI-based cosmological emulators.
Abstract
The potential of deep learning-based image-to-image translations has recently attracted significant attention. One possible application of such a framework is as a fast, approximate alternative to cosmological simulations, which would be particularly useful in various contexts, including covariance studies, investigations of systematics, and cosmological parameter inference. To investigate different aspects of learning-based cosmological mappings, we choose two approaches for generating suitable cosmological matter fields as datasets: a simple analytical prescription provided by the Zel'dovich approximation, and a numerical N-body method using the Particle-Mesh approach. The evolution of structure formation is modeled using U-Net, a widely employed convolutional image translation framework. Because of the lack of a controlled methodology, validation of these learned mappings requires multiple benchmarks beyond simple visual comparisons and summary statistics. A comprehensive list of metrics is considered, including higher-order correlation functions, conservation laws, topological indicators, and statistical independence of density fields. We find that the U-Net approach performs well only for some of these physical metrics, and accuracy is worse at increasingly smaller scales, where the dynamic range in density is large. By introducing a custom density-weighted loss function during training, we demonstrate a significant improvement in the U-Net results at smaller scales. This study provides an example of how a family of physically motivated benchmarks can, in turn, be used to fine-tune optimization schemes -- such as the density-weighted loss used here -- to significantly enhance the accuracy of scientific machine learning approaches by focusing attention on relevant features.
