pop-cosmos: Forward modeling KiDS-1000 redshift distributions using realistic galaxy populations
Boris Leistedt, Hiranya V. Peiris, Anik Halder, Stephen Thorp, Daniel J. Mortlock, Arthur Loureiro, Justin Alsing, Gurjeet Jagwani, Madalina N. Tudorache, Sinan Deger, Joel Leja, Benedict Van den Bussche, Angus H. Wright, Shun-Sheng Li, Konrad Kuijken, Hendrik Hildebrandt
TL;DR
This work tackles the challenge of calibrating galaxy redshift distributions for Stage IV weak-lensing surveys by forward-modeling KiDS-1000 data through a combination of an empirically calibrated population model (pop-cosmos) and a data model learned from SKiLLS image simulations. By comparing pop-cosmos with the shark semi-analytic model under the same data model, it demonstrates that the choice of population can shift the tomographic redshift distributions by $\Delta z \sim 0.05$–$0.1$ in the edge bins, underscoring the importance of accurate color–redshift relations. The framework, trained on COSMOS2020 and SKiLLS, reproduces the observed data properties and provides a scalable, spectroscopic-calibration–independent path to $n(z)$ for KiDS-1000, with clear implications for Stage IV analyses. This forward-modeling approach offers a robust cross-check against spectroscopic calibrations and is extendable to full KiDS cosmology and future surveys where percent-level precision on cosmological parameters is demanded.
Abstract
The accuracy of the cosmological constraints from Stage~IV galaxy surveys will be limited by how well the galaxy redshift distributions can be inferred. We have addressed this challenging problem for the Kilo-Degree Survey (KiDS) cosmic shear sample by developing a forward-modeling framework with two main ingredients: (1) the \texttt{pop-cosmos} generative model for the evolving galaxy population, calibrated on \textit{Spitzer} IRAC $\textit{Ch.\,1}<26$ galaxies from COSMOS2020; and (2) a data model for noise and selection, machine-learned from the SURFS-based KiDS-Legacy-Like Simulations (SKiLLS). Applying KiDS tomographic binning to our synthetic photometric data, we infer redshift distributions in each of five bins directly from the population and data models, bypassing the need for spectroscopic reweighting. Keeping the data model fixed, we compare results using two different galaxy population models: \texttt{pop-cosmos}; and \texttt{shark}, the semi-analytic galaxy formation model used in SKiLLS. In the first ($0.1<z<0.3$) and last ($0.9<z<1.2$) tomographic bins we find systematic differences in the mean redshifts of $Δz\sim0.05$-$0.1$, comparable to the reported uncertainties from spectroscopic reweighting methods. This work paves the way for accurate redshift distribution calibration for Stage~IV surveys directly through forward modeling, thus providing an independent cross-check on spectroscopic-based calibrations which avoids their selection biases and incompleteness. We will use the \texttt{pop-cosmos} redshift distributions in an upcoming full KiDS cosmology reanalysis.
