Accelerating Reionization Constraints: An ANN-Emulator Framework for the SCRIPT Semi-numerical Model
Saptarshi Sarkar, Tirthankar Roy Choudhury
TL;DR
Constraining the Epoch of Reionization with high-fidelity simulations is computationally expensive, limiting exploration of multi-parameter models. The paper presents an ANN-based emulator framework that combines a coarse-resolution MCMC to locate the high-likelihood region with adaptive, targeted sampling to train an emulator for the SCRIPT semi-numerical reionization model. The resulting emulators achieve $R^2$ around $0.97$–$0.99$ using roughly $10^3$ high-resolution simulations and reproduce full high-resolution posteriors within an MCMC while reducing expensive simulations by ~100× and CPU cost by up to ~70×. This approach enables efficient inference in higher-dimensional EoR models and provides a general strategy for next-generation reionization constraints, including JWST and upcoming 21 cm data.
Abstract
Constraining the Epoch of Reionization (EoR) with physically motivated simulations is hampered by the high cost of conventional parameter inference. We present an efficient emulator-based framework that dramatically reduces this bottleneck for the photon-conserving semi-numerical code SCRIPT. Our approach combines (i) a reliable coarse-resolution MCMC to locate the high-likelihood region (exploiting the large-scale convergence of SCRIPT) with (ii) an adaptive, targeted sampling strategy to build a compact high-resolution training set for an artificial neural network based emulator of the model likelihood. With only $\approx 10^3$ high-resolution simulations, the trained emulators achieve excellent predictive accuracy ($R^2 \approx 0.97$--$0.99$) and, when embedded within an MCMC framework, reproduce posterior distributions from full high-resolution runs. Compared to conventional MCMC, our pipeline reduces the number of expensive simulations by a factor of $\sim 100$ and lowers total CPU cost by up to a factor of $\sim 70$, while retaining statistical fidelity. This computational speedup makes inference in much higher-dimensional models tractable (e.g., those needed to incorporate JWST and upcoming 21 cm datasets) and provides a general strategy for building efficient emulators for next generation of EoR constraints.
