287,872 Supermassive Black Holes Masses: Deep Learning Approaching Reverberation Mapping Accuracy
Yuhao Lu, HengJian SiTu, Jie Li, Yixuan Li, Yang Liu, Wenbin Lin, Yu Wang
TL;DR
This work develops QuasarSpecNet, a physically informed encoder–decoder autoencoder trained on RM-based black hole masses from the SDSS-RM sample to infer SMBH masses from single-epoch SDSS spectra. The model jointly reconstructs spectra and predicts RM-calibrated masses, enforcing a latent space that captures global virial features while preserving spectral detail via skip connections. Applied to 287,872 quasars across z ≈ 0–4, it delivers masses with RMSE ≈ 0.058 dex (≈14% relative) and R^2 ≈ 0.91 relative to RM masses, outperforming traditional single-line virial estimators and providing mass estimates where those methods fail. The resulting catalog enables robust population analyses, including a differential mass function with a break near 3×10^8 M⊙ and insights into broad vs. narrow line quasars, redshift distribution, and high-redshift limitations, while highlighting the need for infrared data to extend RM-calibrated accuracy to z > 4.
Abstract
We present a population-scale catalogue of 287,872 supermassive black hole masses with high accuracy. Using a deep encoder-decoder network trained on optical spectra with reverberation-mapping (RM) based labels of 849 quasars and applied to all SDSS quasars up to $z=4$, our method achieves a root-mean-square error of $0.058$\,dex, a relative uncertainty of $\approx 14\%$, and coefficient of determination $R^{2}\approx0.91$ with respect to RM-based masses, far surpassing traditional single-line virial estimators. Notably, the high accuracy is maintained for both low ($<10^{7.5}\,M_\odot$) and high ($>10^{9}\,M_\odot$) mass quasars, where empirical relations are unreliable.
