COSMOS Spectroscopic Redshift Compilation (First Data Release): 488k Redshifts Encompassing Two Decades of Spectroscopy
Ali Ahmad Khostovan, Jeyhan S. Kartaltepe, Mara Salvato, Olivier Ilbert, Caitlin M. Casey, Hiddo Algera, Jacqueline Antwi-Danso, Andrew Battisti, Malte Brinch, Marcella Brusa, Antonello Calabro, Peter L. Capak, Nima Chartab, Olivia R. Cooper, Isa G. Cox, Behnam Darvish, Nicole E. Drakos, Andreas L. Faisst, Matthew R. George, Ghassem Gozaliasl, Santosh Harish, Gunther Hasinger, Hossein Hatamnia, Angela Iovino, Shuowen Jin, Daichi Kashino, Anton M. Koekemoer, Ronaldo Laishram, Khee-Gan Lee, Jitrapon Lertprasertpong, Simon J. Lilly, Daizhong Liu, Daniel C. Masters, Bahram Mobasher, Tohru Nagao, Masato Onodera, Yingjie Peng, David B. Sanders, Ryan L. Sanders, Zahra Sattari, Nick Scoville, Ekta A. Shah, John D. Silverman, Nao Suzuki, Sina Taamoli, Masayuki Tanaka, Lidia A. M. Tasca, Sune Toft, Greta Toni, Benny Trakhtenbrot, Jonathan R. Trump, Mattia Vaccari, Francesco Valentino, Brittany N. Vanderhoof, John R. Weaver, Min S. Yun, Jorge A. Zavala
TL;DR
The COSMOS Spec-$z$ Compilation DR1 unifies nearly two decades of spectroscopic redshifts across $138$ programs over a $10$ deg$^2$ field, enabling robust calibration and galaxy-population studies up to $z \sim 8$. It details a uniform data-processing pipeline that harmonizes quality flags, corrects astrometry to COSMOS2020, and flags duplicates, while re-deriving physical properties with redshifts fixed to spectroscopic values using both CIGALE and LePHARE. The work analyzes completeness, stellar-mass distributions, and the demographics of quiescent, star-forming, and bursty galaxies, and demonstrates practical uses in photo-$z$ validation and SOM-based gap analysis of spectroscopic coverage. As a living resource, it sets the stage for future data releases that will incorporate JWST, Euclid, Roman, and ground-based surveys, expanding coverage to fainter populations and enabling detailed environmental studies of large-scale structure.
Abstract
We present the COSMOS Spectroscopic Redshift Compilation encompassing ~ 20 years of spectroscopic redshifts within a 10 deg$^2$ area centered on the 2 deg$^2$ COSMOS legacy field. This compilation contains 487,666 redshifts of 266,284 unique objects from 138 individual observing programs up to $z \sim 8$ with median stellar mass $\sim 10^{8.4}$ to $10^{10}$ M$_\odot$ (redshift dependent). Rest-frame $NUVrJ$ colors and SFR -- stellar mass correlations show the compilation primarily contains low- to intermediate-mass star-forming and massive, quiescent galaxies at $z < 1.25$ and mostly low-mass bursty star-forming galaxies at $z > 2$. Sources in the compilation cover a diverse range of environments, including protoclusters such as ``Hyperion''. The full compilation is 50\% spectroscopically complete by $i \sim 23.4$ and $K_s \sim 21.6$ mag; however, this is redshift dependent. Spatially, the compilation is $>50$\% ($>30$\%) complete within the central (outer) region limited to $i < 24$ mag and $K_s < 22.5$ mag, separately. We demonstrate how the compilation can be used to validate photometric redshifts and investigate calibration metrics. By training self-organizing maps on COSMOS2020/Classic and projecting the compilation onto it, we find key galaxy subpopulations that currently lack spectroscopic coverage including $z < 1$ intermediate-mass quiescent galaxies and low-/intermediate-mass bursty star-forming galaxies, $z \sim 2$ massive quiescent galaxies, and $z > 3$ massive star-forming galaxies. This highlights how combining self-organizing maps with our compilation can provide guidance for future spectroscopic observations to get a complete spectroscopic view of galaxy populations. Lastly, the compilation will undergo periodic data releases that incorporate new spectroscopic redshift measurements, providing a lasting legacy resource for the community.
