Blazar classification from multi-wavelength data using Deep Learning
Saqlain Afroz, Titir Mukherjee, Raj Prince
TL;DR
The paper tackles blazar subclassification by training a deep feed-forward neural network to distinguish BL Lac objects from FSRQs using multi-wavelength data sourced from 4FGL-DR4, ROMA-BZCAT, and 3LAC. It compares two datasets: a four-feature setup and an expanded seven-feature setup, evaluated with five-fold cross-validation and 700 epochs per fold, achieving high accuracy (≈0.92 and ≈0.91, respectively) and strong ROC/PR performance with an AUC around 0.977. The study demonstrates that even a comparatively simple architecture can leverage cross-band information to classify BCUs, while highlighting the dependence on complete multi-wavelength coverage and proposing future interpretability analyses. The workflow and results support scalable, data-driven blazar population studies and can inform follow-up observations and CTA-era surveys by refining blazar catalogs through multi-wavelength machine learning.
Abstract
The launch of the Fermi-LAT telescope has revolutionized gamma-ray astronomy by detecting over 7,000 gamma-ray emitting objects. A major fraction of the objects are blazars of known type, and a similar fraction of objects were classified as blazars of uncertain types (BCUs). Apart from that, some of the objects were found to be unassociated with any other classes, and no information is available in other wavebands. These types of objects are classified as unassociated objects in the Fermi catalog. To classify the unassociated objects into known categories and BCUs into a known type of blazar, numerous efforts have been made using the machine learning approach. The ideal way of classification would be to have multi-wavelength temporal and spectral information, which is nearly impossible to have for this number of objects in the near future. In this paper, we focus on classifying BCUs into other types of blazars, such as FSRQs and BL Lacs. For this purpose, we have developed for the first time a deep feed-forward Artificial Neural Network (ANN) to classify them, using multi-wavelength data. The complete understanding of blazars can only be known through multi-wavelength observation, and hence, we begin with four input parameters that cover broadband information (radio, optical, X-ray fluxes, and redshift) to train the neural network, and then extend the framework by including additional parameters to examine their impact on the outcome.
