Design and optimization of neural networks for multifidelity cosmological emulation
Yanhui Yang, Simeon Bird, Ming-Feng Ho, Mahdi Qezlou
TL;DR
The paper tackles the computational bottleneck of cosmological emulation on nonlinear scales by replacing Gaussian-process surrogates with a neural-network–based multifidelity framework, T2N-MusE. It introduces a 2-step multifidelity architecture, a 2-stage hyperparameter optimization, a 2-phase training strategy, and per-redshift PCA to handle high-dimensional outputs, demonstrating substantial accuracy gains on the Goku simulation suite. The results show mean LOOCV error reductions by over a factor of 5 and worst-case reductions by ~8× compared with prior GP-based approaches, culminating in the production of GokuNEmu, a highly capable matter power spectrum emulator. The approach provides scalable, efficient training for large, high-dimensional cosmological parameter spaces and can be extended to other summary statistics with publicly available code.
Abstract
Accurate and efficient simulation-based emulators are essential for interpreting cosmological survey data down to nonlinear scales. Multifidelity emulation techniques reduce simulation costs by combining high- and low-fidelity data, but traditional regression methods such as Gaussian processes struggle with scalability in sample size and dimensionality. In this work, we present T2N-MusE, a neural network framework characterized by (i) a novel 2-step multifidelity architecture, (ii) a 2-stage Bayesian hyperparameter optimization, (iii) a 2-phase $k$-fold training strategy, and (iv) a per-$z$ principal component analysis strategy. We apply T2N-MusE to selected data from the Goku simulation suite, covering a 10-dimensional cosmological parameter space, and build emulators for the matter power spectrum over a range of redshifts with different configurations. We find the emulators outperform our earlier Gaussian process models significantly and demonstrate that each of these techniques is efficient in training neural networks or/and effective in improving generalization accuracy. We observe a reduction in the mean error by more than a factor of five and in the worst-case error by approximately a factor of eight in leave-one-out cross-validation, relative to previous work. This framework has been used to build the most powerful emulator for the matter power spectrum, GokuNEmu, and will also be used to construct emulators for other statistics in future.
