COSMOS-Web: Estimating Physical Parameters of Galaxies Using Self-Organizing Maps
Fatemeh Abedini, Ghassem Gozaliasl, Akram Hasani Zonoozi, Atousa Kalantari, Maarit Korpi-Lagg, Olivier Ilbert, Hollis Akins, Natalie Allen, Rafael Arango-Toro, Caitlin Casey, Nicole Drakos, Andreas Faisst, Carter Flayhart, Maximilien Franco, Hosein Haghi, Aryana Haghjoo, Santosh Harish, Hossein Hatamnia, Jeyhan Kartaltepe, Ali Khostovan, Anton Koekemoer, Vasily Kokorev, Rebecca Larson, Gavin Leroy, Daizhong Liu, Henry McCracken, Jed McKinney, Nicolas McMahon, Wilfried Mercier, Bahram Mobasher, Sophie Newman, Louise Paquereau, Jason Rhodes, Brant Robertson, Sogol Sanjaripour, Marko Shuntov, Sina Taamoli, Sune Toft, Francesco Valentino, Eleni Vardoulaki, John Weaver
TL;DR
This work demonstrates that Self-Organizing Maps can infer key galaxy physical parameters—$z$, $M_*$, $ ext{SFR}$, $ ext{sSFR}$, and age$_{mw}$—from multiband photometry in COSMOS-Web, by training on both HORIZON-AGN mocks and CW data. The authors introduce a covariate-shift alignment to robustly transfer the SOM from simulation to observation, and they evaluate performance using NMAD, RMSE, and Pearson $r$ across redshift bins. On HZ-AGN, redshift, mass, and SFR predictions are accurate with strong correlations, while CW predictions show more degeneracy and scatter, particularly for redshift and age, though stellar mass remains relatively well constrained. When applying the HZ-AGN SOM to CW data, the covariate-alignment approach improves consistency, indicating SOMs can be a fast, interpretable alternative or complement to SED fitting for future large-volume surveys that include JWST bands. Overall, the study highlights both the promise and the challenges of SOM-based galaxy parameter estimation in the era of JWST-based photometry.
Abstract
The COSMOS-Web survey, with its unparalleled combination of multiband data, notably, near-infrared imaging from JWST's NIRCam (F115W, F150W, F277W, and F444W), provides a transformative dataset down to $\sim28$ mag (F444W) for studying galaxy evolution. In this work, we employ Self-Organizing Maps (SOMs), an unsupervised machine learning method, to estimate key physical parameters of galaxies -- redshift, stellar mass, star formation rate (SFR), specific SFR (sSFR), and age -- directly from photometric data out to $z=3.5$. SOMs efficiently project high-dimensional galaxy color information onto 2D maps, showing how physical properties vary among galaxies with similar spectral energy distributions. We first validate our approach using mock galaxy catalogs from the HORIZON-AGN simulation, where the SOM accurately recovers the true parameters, demonstrating its robustness. Applying the method to COSMOS-Web observations, we find that the SOM delivers robust estimates despite the increased complexity of real galaxy populations. Performance metrics ($σ_{\mathrm{NMAD}}$ typically between $0.1$--$0.3$, and Pearson correlation between $0.7$ and $0.9$) confirm the precision of the method, with $\sim$ $70\%$ of predictions within 1$σ$ dex of reference values. Although redshift estimation in COSMOS-Web remains challenging (median $σ_{\mathrm{NMAD}} = 0.04$), the overall success of the highlights its potential as a powerful and interpretable tool for galaxy parameter estimation.
