Optimizing Deep Learning Photometric Redshifts for the Roman Space Telescope with HST/CANDELS
Ashod Khederlarian, Brett H. Andrews, Jeffrey A. Newman, Tianqing Zhang, Biprateep Dey
TL;DR
This work addresses the challenge of obtaining accurate photometric redshifts for the Roman Space Telescope's deep, high-redshift imaging by benchmarking deep learning approaches on HST CANDELS data. It compares a photometry-based baseline, a fully-supervised CNN, a self-supervised contrastive model, and introduces PITA, a semi-supervised framework that jointly optimizes contrastive, color, and redshift losses to leverage unlabeled data. PITA consistently yields the best photo-$z$ performance across metrics and magnitude ranges, with strong scalability as labeled data grows, while self-supervised pretraining alone underperforms at higher redshifts. The findings suggest that for Roman, semi-supervised, multi-task learning that exploits abundant unlabeled imaging data and incorporates domain-informed color and redshift objectives will maximize photo-$z$ accuracy, especially for label-scarce faint populations. These results provide practical guidance for Roman data processing pipelines and motivate further exploration of multi-modal, foundation-model–style approaches with careful task design.
Abstract
Photometric redshifts (photo-$z$'s) will be crucial for studies of galaxy evolution, large-scale structure, and transients with the Nancy Grace Roman Space Telescope. Deep learning methods leverage pixel-level information from ground-based images to achieve the best photo-$z$'s for low-redshift galaxies, but their efficacy at higher redshifts with deep, space-based imaging remains largely untested. We used Hubble Space Telescope CANDELS optical and near-infrared imaging to evaluate fully-supervised, self-supervised, and semi-supervised deep learning photo-$z$ algorithms out to $z\sim3$. Compared to template-based and classical machine learning photometry methods, the fully-supervised and semi-supervised models achieved better performance. Our new semi-supervised model, PITA (Photo-$z$ Inference with a Triple-loss Algorithm), outperformed all others by learning from unlabeled and labeled data through a three-part loss function that incorporates images and colors for all objects as well as redshifts when available. PITA produces a latent space that varies smoothly in magnitude, color, and redshift, resulting in the best photo-$z$ performance even when the redshift training set was significantly reduced. In contrast, the self-supervised approach produced a latent space with significant color and redshift fluctuations that hindered photo-$z$ inference. Looking forward to Roman, we recommend using semi supervised deep learning to take full advantage of the information contained in the hundreds of millions of high-resolution images and color measurements, together with the limited redshift measurements available, to achieve the most accurate photo-$z$ estimates for both faint and bright sources.
