The Challenge in Illuminating the Invisible: Constraining LyC Escape with Bayesian Modelling and Symbolic Regression
Amanda Stoffers, Sandro Tacchella, Charlotte Simmonds, Benjamin D. Johnson, Roberto Maiolino
TL;DR
The paper addresses the indirect challenge of constraining LyC escape during the Epoch of Reionization by applying Bayesian SED fitting with Prospector to local LyC-leaking analogues (LzLCS). It systematically tests multiple prior and dust-attenuation configurations, demonstrates robust recovery of $f_{ m esc}^{ m LyC}$ in most cases, and derives a median escape fraction around 1% with some systems up to 70%, revealing that extreme LyC leakage does not always coincide with extreme global stellar properties. A symbolic regression analysis calibrated on synthetic Prospector data yields a concise relation $\log_{10}(f_{ m esc}) = -3.59\beta - 9.45 \pm 0.30$ that captures $f_{ m esc}^{ m LyC}$ within uncertainties for the LzLCS subset, offering a practical estimator when full SED fitting is impractical. The study highlights both the potential of Bayesian SED modelling to constrain LyC leakage and the limitations imposed by nebular emission modelling and ISM geometry, while framing local analogues as a valuable bridge to understanding reionization-era galaxies. Overall, the work advances LyC diagnostics by combining rigorous inference with data-driven regression, informing how indirect tracers can be calibrated and applied in high-redshift contexts.
Abstract
Direct observations of Lyman continuum (LyC) radiation from galaxies during the Epoch of Reionization (EoR) are impeded by absorption in the intergalactic medium, requiring indirect methods to infer the escape fraction of ionizing photons ($f_{\rm esc}^{\rm LyC}$). One approach is to develop and validate such methods on local analogues of the high-redshift galaxies with directly detected LyC leakage. In this work, we constrain $f_{\rm esc}^{\rm LyC}$ using a Bayesian spectral energy distribution (SED) fitting framework built on Prospector, which incorporates a non-parametric star-formation history, a flexible dust attenuation curve, self-consistent nebular emission, and fiber aperture-loss corrections. Our methodology jointly fits broadband photometry and emission line fluxes. We apply six models to the Low-redshift LyC Survey (LzLCS), a sample of local galaxies with properties comparable to EoR galaxies, and evaluate them based on their ability to recover the observed LyC flux and their relative Bayesian evidence. The best-performing model is further assessed through a parameter recovery test, demonstrating that $f_{\rm esc}^{\rm LyC}$can be recovered within uncertainties. Building on these results, we present updated $f_{\rm esc}^{\rm LyC}$ estimates for the LzLCS sample, with a median of 0.3%, corresponding to very low leakage, and values reaching as high as 70%, with six of 64 galaxies having a cosmologically relevant $f_{\rm esc}^{\rm LyC}$ ($>5%$). Additionally, we present a revised UV $β$-slope vs $\log_{10}(f_\mathrm{esc}^\mathrm{LyC})$ relation, derived using symbolic regression with PySR trained on a synthetic dataset generated from our best-performing model: $\log_{10}(f_{\rm esc}^{\rm LyC}) = (-3.59β- 9.45) \, \pm \, 0.30$. The relation successfully reproduces the $f_{\rm esc}^{\rm LyC}$ obtained from full SED fitting of the LzLCS sample within uncertainties.
