Hydrogen intensity mapping with MeerKAT: Preserving cosmological signal by optimising contaminant separation
Isabella P. Carucci, José L. Bernal, Steven Cunnington, Mario G. Santos, Jingying Wang, José Fonseca, Keith Grainge, Melis O. Irfan, Yichao Li, Alkistis Pourtsidou, Marta Spinelli, Laura Wolz
TL;DR
This work tackles the critical problem of foreground removal in HI intensity mapping by evaluating PCA/SVD-based pipelines and introducing a multiscale PCA (mPCA) framework to preserve cosmological signal while suppressing contaminants. Using MeerKAT L-band data at z≈0.4 cross-correlated with WiggleZ galaxies, the authors demonstrate that cleaning contaminants within a conservative footprint and treating large and small angular scales independently yields robust detections with minimal signal loss, without requiring transfer-function corrections. The study shows that mPCA outperforms traditional PCA in cross-correlation amplitude, stability across k-scales, and variance reduction, marking a key methodological advance for upcoming MeerKAT and SKAO HI intensity mapping efforts. These results enhance confidence in using HI intensity maps to probe large-scale structure and demonstrate the practical viability of optimized, scale-aware foreground separation for future cosmological analyses.
Abstract
Removing contaminants is a delicate, yet crucial step in neutral hydrogen (HI) intensity mapping and often considered the technique's greatest challenge. Here, we address this challenge by analysing HI intensity maps of about $100$ deg$^2$ at redshift $z\approx0.4$ collected by the MeerKAT radio telescope, an SKA Observatory (SKAO) precursor, with a combined 10.5-hour observation. Using unsupervised statistical methods, we removed the contaminating foreground emission and systematically tested, step-by-step, some common pre-processing choices to facilitate the cleaning process. We also introduced and tested a novel multiscale approach: the data were redundantly decomposed into subsets referring to different spatial scales (large and small), where the cleaning procedure was performed independently. We confirm the detection of the HI cosmological signal in cross-correlation with an ancillary galactic data set, without the need to correct for signal loss. In the best set-up we achieved, we were able to constrain the HI distribution through the combination of its cosmic abundance ($Ω_{HI}$) and linear clustering bias ($b_{HI}$) up to a cross-correlation coefficient ($r$). We measured $Ω_{HI}b_{HI}r = [0.93 \pm 0.17]\,\times\,10^{-3}$ with a $\approx6σ$ confidence, which is independent of scale cuts at both edges of the probed scale range ($0.04 \lesssim k \lesssim 0.3 \,h$ Mpc$^{-1}$), corroborating its robustness. Our new pipeline has successfully found an optimal compromise in separating contaminants without incurring a catastrophic signal loss. This development instills an added degree of confidence in the outstanding science we can deliver with MeerKAT on the path towards HI intensity mapping surveys with the full SKAO.
