AppleCiDEr II: SpectraNet -- A Deep Learning Network for Spectroscopic Data
Maojie Xu, Argyro Sasli, Alexandra Junell, Felipe Fontinele Nunes, Yu-Jing Qin, Christoffer Fremling, Sam Rose, Theophile Jegou Du Laz, Benny Border, Antoine Le Calloch, Sushant Sharma Chaudhary, Hailey Markoff, Avyukt Raghuvanshi, Nabeel Rehemtulla, Jesper Sollerman, Yashvi Sharma, Niharika Sravan, Judy Adler, Tracy X. Chen, Richard Dekany, Reed Riddle, Mansi M. Kasliwal, Matthew J. Graham, Michael W. Coughlin
TL;DR
SpectraNet addresses the need for robust spectral classification of time-domain transients by learning multiscale spectral representations through a CNN architecture integrated into the AppleCiDEr multimodal pipeline. It combines rest-frame spectral preprocessing, a redshift regression head, and a five-block multi-scale CNN with a 1×1 projection, optimized with class-balanced focal loss and training techniques like EMA and mixed precision. The approach yields state-of-the-art classification performance and precise redshift estimates, with strong generalization to narrower wavelength coverage as demonstrated on NGPS data. This work enables real-time, spectroscopically informed transient classification and paves the way for LSST-era deployment with domain adaptation and evolution-aware extensions.
Abstract
Time-domain surveys such as the Zwicky Transient Facility (ZTF) have opened a new frontier in the discovery and characterization of transients. While photometric light curves provide broad temporal coverage, spectroscopic observations remain crucial for physical interpretation and source classification. However, existing spectral analysis methods -- often reliant on template fitting or parametric models -- are limited in their ability to capture the complex and evolving spectra characteristic of such sources, which are sometimes only available at low resolution. In this work, we introduce SpectraNet, a deep convolutional neural network designed to learn robust representations of optical spectra from transients. Our model combines multi-scale convolution kernels and multi-scale pooling to extract features from preprocessed spectra in a hierarchical and interpretable manner. We train and validate SpectraNet on low-resolution time-series spectra obtained from the Spectral Energy Distribution Machine (SEDM) and other instruments, demonstrating state-of-the-art performance in classification. Furthermore, in redshift prediction tasks, SpectraNet achieves a root mean squared relative redshift error of 0.02, highlighting its effectiveness in precise regression tasks as well.
