Multiband neural network classification of ZTF light curves as LSST proxies
Tamás Szklenár, Attila Bódi, Róbert Szabó
TL;DR
This work develops a multiband neural network to classify periodic variable stars using phase-folded light curves from ZTF DR17 as a proxy for LSST data, incorporating the stars' periods as numerical inputs. The model uses two parallel CNNs for the $g$ and $r$ bands, whose image-based outputs are fused with a period-derived pathway to perform joint classification, achieving up to about 99% accuracy on several classes in testing. Data from OGLE-IV, Gaia DR3, and ZTF are cross-matched and augmented to balance five main variable-star types, with period estimation conducted in both single and multiband modes; results show multiband inputs significantly improve performance, particularly for Cepheids and long-period variables. The study demonstrates the feasibility of LSST-scale classification pipelines before LSST data arrive and outlines extensions to incorporate more bands (e.g., $i$, $g$, $r$, $i$, $z$, $y$) and multiband period-search techniques, aided by Gaia-based cross-matches and DP1-era data. The approach yields high-precision, scalable classifications essential for time-domain astronomy in upcoming large surveys.
Abstract
In this project we use data obtained by Zwicky Transient Facility to develop and test a neural-network-based, multiband classification algorithm to classify periodic variable stars (i.e. pulsating variable stars and eclipsing binaries). The aim is to utilize the algorithm on LSST data once they become available. Phase-folded light curve images and period information were used from five different variable star types: Classical and Type II Cepheids, δ Scuti stars, eclipsing binaries, and RR Lyrae stars. The data is taken from the 17th data release of ZTF, from which we used two passbands, g and r in this project. The periods were calculated from the raw data and this information was used as an additional numerical input in the neural network. For the training and testing process a supervised machine learning method was created, the neural network contains Convolutional Neural Networks concatenated with Fully Connected Layers. During the training-validation process the training accuracy reached 99% and the validation accuracy peaked at 95.6%. At the test classification phase three variable star types out of the 5 classes were classified with around 99% of accuracy, the other two also had very high accuracy, 89.6% and 93.6%.
