Mapping the Galaxy Color-Redshift Relation: Optimal Photometric Redshift Calibration Strategies for Cosmology Surveys
Daniel Masters, Peter Capak, Daniel Stern, Olivier Ilbert, Mara Salvato, Samuel Schmidt, Giuseppe Longo, Jason Rhodes, Stephane Paltani, Bahram Mobasher, Henk Hoekstra, Hendrik Hildebrandt, Jean Coupon, Charles Steinhardt, Josh Speagle, Andreas Faisst, Adam Kalinich, Mark Brodwin, Massimo Brescia, Stefano Cavuoti
TL;DR
This work presents a data-driven framework using self-organizing maps (SOMs) to map the high-dimensional color space of Euclid-like photometry and assess how well current spectroscopic redshifts cover that space. By projecting COSMOS data into an 8-band, Euclid-like color space, the authors quantify the distribution ρ(⃗C), identify color-space regions lacking secure spectroscopic redshifts, and derive a formal sampling strategy to calibrate the color-redshift relation with minimal spectroscopy. They show that the mean redshift ⟨z⟩ of tomographic bins can be constrained with uncertainty Δ⟨z⟩ ≈ σ⟨z_i⟩/√c, guiding spectroscopic allocation to dense, high-uncertainty cells and projecting a feasible total of ~10–15k spectra for Euclid, depending on strategy. The approach also offers practical insights for identifying degeneracies, refining template priors, and planning calibration campaigns for Euclid, DES, LSST, and WFIRST.
Abstract
Calibrating the photometric redshifts of >10^9 galaxies for upcoming weak lensing cosmology experiments is a major challenge for the astrophysics community. The path to obtaining the required spectroscopic redshifts for training and calibration is daunting, given the anticipated depths of the surveys and the difficulty in obtaining secure redshifts for some faint galaxy populations. Here we present an analysis of the problem based on the self-organizing map, a method of mapping the distribution of data in a high-dimensional space and projecting it onto a lower-dimensional representation. We apply this method to existing photometric data from the COSMOS survey selected to approximate the anticipated Euclid weak lensing sample, enabling us to robustly map the empirical distribution of galaxies in the multidimensional color space defined by the expected Euclid filters. Mapping this multicolor distribution lets us determine where - in galaxy color space - redshifts from current spectroscopic surveys exist and where they are systematically missing. Crucially, the method lets us determine whether a spectroscopic training sample is representative of the full photometric space occupied by the galaxies in a survey. We explore optimal sampling techniques and estimate the additional spectroscopy needed to map out the color-redshift relation, finding that sampling the galaxy distribution in color space in a systematic way can efficiently meet the calibration requirements. While the analysis presented here focuses on the Euclid survey, similar analysis can be applied to other surveys facing the same calibration challenge, such as DES, LSST, and WFIRST.
