Machine Learning-Driven Analysis of kSZ Maps to Predict CMB Optical Depth $τ$
Farshid Farhadi Khouzani, Abinash Kumar Shaw, Paul La Plante, Bryar Mustafa Shareef, Laxmi Gewali
TL;DR
The paper tackles the challenge of constraining the CMB optical depth $τ$ from the non-Gaussian kinetic Sunyaev-Zel'dovich (kSZ) signal of the Epoch of Reionization. It advances a machine learning pipeline that uses a Swin Transformer for multi-scale feature extraction from simulated kSZ maps and employs the Laplace Approximation to deliver principled uncertainty quantification, comparing post-hoc and online deployment modes. A seminumeric pipeline generates 1,000 map–$τ$ pairs across diverse reionization histories, enabling rigorous evaluation; the post-hoc LA model achieves high predictive accuracy (e.g., $R^2\approx0.93$) with well-calibrated uncertainties, outperforming the online LA variant. The results demonstrate a robust method to extract $τ$ from kSZ data and pave the way for applying these techniques to real CMB surveys, where reliable error bars are essential for cosmological inference.
Abstract
Upcoming measurements of the kinetic Sunyaev-Zel'dovich (kSZ) effect, which results from Cosmic Microwave Background (CMB) photons scattering off moving electrons, offer a powerful probe of the Epoch of Reionization (EoR). The kSZ signal contains key information about the timing, duration, and spatial structure of the EoR. A precise measurement of the CMB optical depth $τ$, a key parameter that characterizes the universe's integrated electron density, would significantly constrain models of early structure formation. However, the weak kSZ signal is difficult to extract from CMB observations due to significant contamination from astrophysical foregrounds. We present a machine learning approach to extract $τ$ from simulated kSZ maps. We train advanced machine learning models, including swin transformers, on high-resolution seminumeric simulations of the kSZ signal. To robustly quantify prediction uncertainties of $τ$, we employ the Laplace Approximation (LA). This approach provides an efficient and principled Gaussian approximation to the posterior distribution over the model's weights, allowing for reliable error estimation. We investigate and compare two distinct application modes: a post-hoc LA applied to a pre-trained model, and an online LA where model weights and hyperparameters are optimized jointly by maximizing the marginal likelihood. This approach provides a framework for robustly constraining $τ$ and its associated uncertainty, which can enhance the analysis of upcoming CMB surveys like the Simons Observatory and CMB-S4.
