Table of Contents
Fetching ...

Preliminary Report on Mantis Shrimp: a Multi-Survey Computer Vision Photometric Redshift Model

Andrew Engel, Gautham Narayan, Nell Byler

TL;DR

This work tackles photometric redshift estimation in the era of large, multi-survey astronomical data by proposing Mantis Shrimp, a multi-modal CNN that ingests nine-band images spanning GALEX UV, Pan-STARRS optical, and UnWISE IR data. The authors cast redshift estimation as a $C=200$-class classification with $\delta c=0.005$ over $z\in[0,1]$, augment the network with line-of-sight dust extinction, and train an adapted ResNet50 for 85 epochs on a representative subset of $N_{ ext{eff}}=2.5\times10^5$ from a total sample of $N=4.2\times10^6$, using luptitude-like scaling and a single fused input tensor. They apply SHAP-based interpretability to quantify each band’s contribution (MM-SHAP) and assess calibration with PIT, linking modality importance to physical features in the spectral energy distribution. While the model currently falls short of the best optical+IR benchmarks, it demonstrates the feasibility of multi-survey image fusion for photo-z, offers physics-grounded insights into band prioritization across redshift, and paves the way for data-efficient training and future fusion strategies. The work thus provides a practical pathway to leveraging heterogeneous astronomical data for scalable, interpretable photometric redshift estimation with potential broad impact on upcoming survey analyses.

Abstract

The availability of large, public, multi-modal astronomical datasets presents an opportunity to execute novel research that straddles the line between science of AI and science of astronomy. Photometric redshift estimation is a well-established subfield of astronomy. Prior works show that computer vision models typically outperform catalog-based models, but these models face additional complexities when incorporating images from more than one instrument or sensor. In this report, we detail our progress creating Mantis Shrimp, a multi-survey computer vision model for photometric redshift estimation that fuses ultra-violet (GALEX), optical (PanSTARRS), and infrared (UnWISE) imagery. We use deep learning interpretability diagnostics to measure how the model leverages information from the different inputs. We reason about the behavior of the CNNs from the interpretability metrics, specifically framing the result in terms of physically-grounded knowledge of galaxy properties.

Preliminary Report on Mantis Shrimp: a Multi-Survey Computer Vision Photometric Redshift Model

TL;DR

This work tackles photometric redshift estimation in the era of large, multi-survey astronomical data by proposing Mantis Shrimp, a multi-modal CNN that ingests nine-band images spanning GALEX UV, Pan-STARRS optical, and UnWISE IR data. The authors cast redshift estimation as a -class classification with over , augment the network with line-of-sight dust extinction, and train an adapted ResNet50 for 85 epochs on a representative subset of from a total sample of , using luptitude-like scaling and a single fused input tensor. They apply SHAP-based interpretability to quantify each band’s contribution (MM-SHAP) and assess calibration with PIT, linking modality importance to physical features in the spectral energy distribution. While the model currently falls short of the best optical+IR benchmarks, it demonstrates the feasibility of multi-survey image fusion for photo-z, offers physics-grounded insights into band prioritization across redshift, and paves the way for data-efficient training and future fusion strategies. The work thus provides a practical pathway to leveraging heterogeneous astronomical data for scalable, interpretable photometric redshift estimation with potential broad impact on upcoming survey analyses.

Abstract

The availability of large, public, multi-modal astronomical datasets presents an opportunity to execute novel research that straddles the line between science of AI and science of astronomy. Photometric redshift estimation is a well-established subfield of astronomy. Prior works show that computer vision models typically outperform catalog-based models, but these models face additional complexities when incorporating images from more than one instrument or sensor. In this report, we detail our progress creating Mantis Shrimp, a multi-survey computer vision model for photometric redshift estimation that fuses ultra-violet (GALEX), optical (PanSTARRS), and infrared (UnWISE) imagery. We use deep learning interpretability diagnostics to measure how the model leverages information from the different inputs. We reason about the behavior of the CNNs from the interpretability metrics, specifically framing the result in terms of physically-grounded knowledge of galaxy properties.
Paper Structure (25 sections, 2 equations, 10 figures, 3 tables)

This paper contains 25 sections, 2 equations, 10 figures, 3 tables.

Figures (10)

  • Figure 1: Shapley (left) and MM-SHAP (right) values for each band, averaged across the samples in each redshift bin. We find that the model prioritizes information from the three instruments differently, and that prioritization changes as a function of redshift. This is inline with our understanding of the underlying physics, see appendix \ref{['appendix:physics']} for a review.
  • Figure 2: Intuition about which channels should be important when. Top Left. A representative galaxy SED (taken from the atlas in Brown14_galTemplates) is plot at that galaxy's rest-frame, with the photometric filters used in this work overlaid. Additionally, important breaks in the galaxy SED are identified with vertical black lines. Lower left. We plot the same SED except redshifted to a value of 1.0 to visually represent how light incoming from a galaxy appears in our observation frame. The important breaks/phenomena have shifted with the spectra and now coincide in different parts of filter-space. Right. Finally, we plot each of the SED Break characteristic wavelength as a function of redshift. These curves pick out at which filters we expect the be important at each redshift. A description of these breaks are given in appendix \ref{['appendix:physics']}. Finally, note the filter regions are shown as uniform boxes for visual ease-- the actual filters have unique, bell shaped transmission curves.
  • Figure 3: Distribution of targets in redshift visualized by individual survey
  • Figure 4: Distribution of targets in redshift. The shape of the entire distribution is dominated by SDSS and DESI.
  • Figure 5: Distribution of targets In the sky. Plot as astrometric position on the night sky (RA, DEC). Dominated by the SDSS and DESI footprint, with some deep drilling wells from spectroscopic surveys showing as particularly bright plots. The southern sky south of DEC=$-30^\circ$ is excluded due to the PanSTARRS footprint. The sparse region of the northern sky spans the milky-way.
  • ...and 5 more figures