statmorph-lsst: Quantifying and correcting morphological biases in galaxy surveys
Elizaveta Sazonova, Cameron R. Morgan, Michael Balogh, Matías Blaña, Carlos G. Bornancini, Darko Donevski, Alister Graham, Hector M. Hernandez Toledo, Benne W. Holwerda, Jeyhan S. Kartaltepe, Garreth Martin, William J. Pearson, Rossella Ragusa, Vicente Rodriguez-Gomez, Michael J. Rutkowski, Jose Antonio Vázquez-Mata, Rogier A. Windhorst
TL;DR
This work addresses how imaging quality biases morphologies in galaxy surveys and provides a comprehensive framework to quantify and correct these biases. By degrading a large, diverse local galaxy sample, the authors map the dependence of all statmorph and single-Sérsic metrics on resolution and depth, and derive empirical corrections via symbolic regression, including two new measures, A_X and St. They show that geometric measures are generally robust, while concentration-based bulge indicators and disturbance metrics are substantially biased by PSF, resolution, and signal-to-noise, explaining part of the apparent evolution seen in high-z studies. The study yields practical outcomes: the statmorph-lsst Python package, a 64,000-image augmentation dataset, and a set of correction functions to enable bias-aware analyses for Rubin LSST and multi-wavelength morphology studies, facilitating robust interpretation of galaxy structure across cosmic time.
Abstract
Quantitative morphology provides a key probe of galaxy evolution across cosmic time and environments. However, these metrics can be biased by changes in imaging quality - resolution and depth - either across the survey area or the sample. To prepare for the upcoming Rubin LSST data, we investigate this bias for all metrics measured by statmorph and single-component Sérsic fitting with Galfit. We find that geometrical measurements (ellipticity, axis ratio, Petrosian radius, and effective radius) are fairly robust at most depths and resolutions. Light concentration measurements ($C$, Gini, $M_{20}$) systematically decrease with resolution, leading low-mass or high-redshift bulge-dominated sources to appear indistinguishable from disks. Sérsic index $n$, while unbiased, suffers from a 20-40% uncertainty due to degeneracies in the Sérsic fit. Disturbance measurements ($A$, $A_S$, $D$) depend on signal-to-noise and are thus affected by noise and surface-brightness dimming. We quantify this dependence for each parameter, offer empirical correction functions, and show that the evolution in $C$ observed in JWST galaxies can be explained purely by observational biases. We propose two new measurements - isophotal asymmetry $A_X$ and substructure $St$ - that aim to resolve some of these biases. Finally, we provide a Python package statmorph-lsst implementing these changes and a full dataset that enables tests of custom functions (see text for links).
