Estimating stellar atmospheric parameters and elemental abundances using fully connected residual network
Shuo Li, Yin-Bi Li, A-Li Luo, Jun-Chao Liang, Hai-Ling Lu, Hugh R. A. Jones
TL;DR
This work tackles the challenge of inferring stellar atmospheric parameters and elemental abundances from ultra-low-resolution spectra ($R\sim200$) by training a Fully Connected Residual Network (FCResNet) on LAMOST spectra degraded to CSST-like resolution with APOGEE labels. FCResNet combines a simple fully connected architecture with residual connections to enable stable training and high predictive accuracy, outperforming KNN, XGBoost, SVR, and CNN in predicting $T_ ext{eff}$, $\log g$, [Fe/H], [C/Fe], [N/Fe], and [Mg/Fe], while remaining computationally efficient (42 s for 10^6 spectra) and lightweight (348 KB). The model demonstrates robust performance across varying $S/N_g$, strong agreement with APOGEE and PARSEC isochrones in Kiel diagrams, and interpretable spectral feature usage via SHAP (e.g., MgI b at ~5184 Å). The results indicate FCResNet is a practical, scalable tool for CSST's large-volume ultra-low-resolution data, with potential to extend to broader abundance analyses and guiding future survey pipelines.
Abstract
Stellar atmospheric parameters and elemental abundances are traditionally determined using template matching techniques based on high-resolution spectra. However, these methods are sensitive to noise and unsuitable for ultra-low-resolution data. Given that the Chinese Space Station Telescope (CSST) will acquire large volumes of ultra-low-resolution spectra, developing effective methods for ultra-low-resolution spectral analysis is crucial. In this work, we investigated the Fully Connected Residual Network (FCResNet) for simultaneously estimating atmospheric parameters ($T_\text{eff}$, $\log g$, [Fe/H]) and elemental abundances ([C/Fe], [N/Fe], [Mg/Fe]). We trained and evaluated FCResNet using CSST-like spectra (\textit{R} $\sim$ 200) generated by degrading LAMOST spectra (\textit{R} $\sim$ 1,800), with reference labels from APOGEE. FCResNet significantly outperforms traditional machine learning methods (KNN, XGBoost, SVR) and CNN in prediction precision. For spectra with g-band signal-to-noise ratio greater than 20, FCResNet achieves precisions of 78 K, 0.15 dex, 0.08 dex, 0.05 dex, 0.10 dex, and 0.05 dex for $T_\text{eff}$, $\log g$, [Fe/H], [C/Fe], [N/Fe] and [Mg/Fe], respectively, on the test set. FCResNet processes one million spectra in only 42 seconds while maintaining a simple architecture with just 348 KB model size. These results suggest that FCResNet is a practical and promising tool for processing the large volume of ultra-low-resolution spectra that will be obtained by CSST in the future.
