QuasarNET: Human-level spectral classification and redshifting with Deep Neural Networks
Nicolas Busca, Christophe Balland
TL;DR
QuasarNET introduces a convolutional neural network that treats quasar spectral classification and redshift estimation as a feature-detection task, matching human-expert performance on BOSS data. The model detects seven emission-line features across multiple wavelength intervals, yielding purity and completeness around 99.5% for quasar selection and a redshift precision with Δv ~ 8 km/s and catastrophic failures < 0.2%. It extends to BAL quasar detection, achieving high BAL/non-BAL recognition while maintaining redshift reliability. Compared with automatic PCA-based classifiers, QuasarNET delivers superior sample quality with reduced need for visual inspection, and its training/inference efficiency supports application to upcoming surveys like DESI and 4MOST. The approach shows robust performance across redshift and motivates using QuasarNET outputs as priors for more precise spectral modeling.
Abstract
We introduce QuasarNET, a deep convolutional neural network that performs classification and redshift estimation of astrophysical spectra with human-expert accuracy. We pose these two tasks as a \emph{feature detection} problem: presence or absence of spectral features determines the class, and their wavelength determines the redshift, very much like human-experts proceed. When ran on BOSS data to identify quasars through their emission lines, QuasarNET defines a sample $99.51\pm0.03$\% pure and $99.52\pm0.03$\% complete, well above the requirements of many analyses using these data. QuasarNET significantly reduces the problem of line-confusion that induces catastrophic redshift failures to below 0.2\%. We also extend QuasarNET to classify spectra with broad absorption line (BAL) features, achieving an accuracy of $98.0\pm0.4$\% for recognizing BAL and $97.0\pm0.2$\% for rejecting non-BAL quasars. QuasarNET is trained on data of low signal-to-noise and medium resolution, typical of current and future astrophysical surveys, and could be easily applied to classify spectra from current and upcoming surveys such as eBOSS, DESI and 4MOST.
