Learning the Universe: Cosmological and Astrophysical Parameter Inference with Galaxy Luminosity Functions and Colours
Christopher C. Lovell, Tjitske Starkenburg, Matthew Ho, Daniel Anglés-Alcázar, Romeel Davé, Austen Gabrielpillai, Kartheik Iyer, Alice E. Matthews, William J. Roper, Rachel Somerville, Laura Sommovigo, Francisco Villaescusa-Navarro
TL;DR
We introduce a simulation-based inference framework to jointly constrain cosmological parameters $Ω_m$ and $σ_8$ and astrophysical feedback from forward-modelled galaxy luminosity functions and colours using the CAMELS hydrodynamic simulations. The forward model (Synthesizer) computes rest-frame UV–NIR emission with two SPS models and a simple dust attenuation scheme, enabling rest-frame photometry across multiple bands and redshifts; a neural posterior estimation (LtU-ILI SBI) then infers the desired parameters from LF and colour distributions. We find that $Ω_m$ is tightly constrained by colours, while $σ_8$ benefits most from combining LFs and colours and from higher redshift information (notably $z=2$). Generalization across galaxy formation models is limited due to divergent subgrid physics and a simplistic forward model, underscoring the need for more flexible emission modelling and larger, higher-fidelity simulations. The generated photometric catalogs are public, illustrating a promising direction for cosmology- with-baryons analyses that complement traditional clustering probes.
Abstract
We perform the first direct cosmological and astrophysical parameter inference from the combination of galaxy luminosity functions and colours using a simulation based inference approach. Using the Synthesizer code we simulate the dust attenuated ultraviolet-near infrared stellar emission from galaxies in thousands of cosmological hydrodynamic simulations from the CAMELS suite, including the Swift-EAGLE, IllustrisTNG, Simba & Astrid galaxy formation models. For each galaxy we calculate the rest-frame luminosity in a number of photometric bands, including the SDSS $\textit{ugriz}$ and GALEX FUV & NUV filters; this dataset represents the largest catalogue of synthetic photometry based on hydrodynamic galaxy formation simulations produced to date, totalling >200 million sources. From these we compile luminosity functions and colour distributions, and find clear dependencies on both cosmology and feedback. We then perform simulation based (likelihood-free) inference using these distributions to constrain $Ω_m$, $σ_8$, and four parameters controlling the strength of stellar and AGN feedback. Both colour distributions and luminosity functions provide complementary information on certain parameters when performing inference. We achieve constraints on the stellar feedback parameters, as well as $Ω_m$ and $σ_8$. The latter is attributable to the fact that the photometry encodes the star formation-metal enrichment history of each galaxy; galaxies in a universe with a higher $σ_8$ tend to form earlier and have higher metallicities, which leads to redder colours. We find that a model trained on one galaxy formation simulation generalises poorly when applied to another, and attribute this to differences in the subgrid prescriptions, and lack of flexibility in our emission modelling. The photometric catalogues are publicly available at: https://camels.readthedocs.io/
