Photometric classification of supernovae detected by the Zwicky Transient Facility using noise augmentation
A. Townsend, J. Nordin, M. Kowalski, S. Reusch, J. P. Anderson, E. C. Bellm, U. Burgaz, T. X. Chen, T. -W. Chen, G. Dimitriadis, L. Galbany, A. Goobar, M. J. Graham, M. Gromadzki, C. P. Gutiérrez, D. Hale, C. Inserra, M. Kasliwal, Y. -L. Kim, K. Maguire, F. J. Masci, T. E. Müller-Bravo, D. A. Perley, R. L. Riddle, M. Rigault, J. van Santen, S. Schulze, M. Smith, J. Sollerman, S. Yang
TL;DR
This work tackles the challenge of constructing a large photometric SN Ia sample from ZTF data by combining ParSNIP-based light-curve representations with a robust noise augmentation strategy that mimics fainter, higher-redshift observations. Through a two-stage pipeline—latent-feature extraction via ParSNIP and a gradient-boosted classifier—the authors achieve SN Ia recall exceeding 98% for favorable observing conditions and maintain high overall classification accuracy while controlling contamination. Real-time performance is demonstrated in NoiZTF, where the classifier correctly identified the majority of targets for spectroscopic follow-up, including rare SLSNe, despite modest detection counts. The approach is shown to generalize across brightness ranges, DR2 validation, and practical cosmology considerations, with clear pathways to LSST-era applications and further improvements via expanded peculiar-class training and survey simulations.
Abstract
Modern time-domain surveys, such as the Zwicky Transient Facility (ZTF), detect far more extragalactic transients than can be spectroscopically classified. Photometric classification offers a scalable alternative, enabling the identification of larger, fainter, and higher-redshift supernova samples suitable for applications such as Type Ia supernova (SN Ia) cosmology. We present a feature-based photometric classifier for SNe detected by ZTF, with the primary goal of constructing a photometric SN Ia sample for cosmological analyses. Our approach utilises the autoencoder architecture of ParSNIP (Boone 2021) to capture the intrinsic diversity of SN light curves. We trained the model on a spectroscopically classified ZTF SN sample, incorporating a realistic noise augmentation procedure that simulates the flux uncertainties of fainter sources. Light curve features were used to train a gradient-boosted decision tree classifier, implemented in both binary (SN Ia vs. non-Ia) and multi-class configurations. We validated our classifier on independent, fainter ZTF data with and without noise augmentation. To evaluate real-time performance, we also applied our classifier to live ZTF alerts and conducted a spectroscopic classification survey within the ePESSTO+ collaboration. We found that noise augmentation significantly improves classification performance, particularly for fainter sources. Our binary classifier achieves an SN Ia recall of (98.1 $\pm$ 0.4)%, averaged across five train-test splits. SN Ia recall exceeds 98% for events with a peak apparent magnitude up to 20 and more than 10 detections, and remains above 96% up to magnitude 20.5. Overall, 95% of sources were correctly classified in both binary and multi-class modes. Our classifier performs efficiently on real ZTF data and enables construction of a large photometric SN Ia sample for cosmology.
