A method to derive self-consistent NLTE astrophysical parameters for 4 million high-resolution 4MOST stellar spectra in half a day with invertible neural networks
Victor F. Ksoll, Nicholas Storm, Maria Bergemann, Katherine Lee, Ralf S. Klessen, R. Albarracín, Guillaume Guiglion, Gražina Tautvaišienė
TL;DR
The paper introduces a simulation-based NLTE inference framework using a conditional invertible neural network (cINN) trained on a large grid of NLTE Turbospectrum spectra to derive self-consistent stellar parameters and chemical abundances from high-resolution 4MOST-like spectra. The cINN yields full posterior distributions, enabling intrinsic uncertainty quantification and revealing degeneracies, while delivering orders-of-magnitude faster inference than traditional LTE approaches. On synthetic NLTE data at S/N ≈ 250 Å−1, the method achieves low biases and competitive scatter for Teff, log g, [Fe/H], and key abundance ratios, with uncertainties increasing at lower S/N and in metal-poor regimes. Validation against Gaia-ESO/PLATO/4MOST benchmark stars shows good agreement with independent TSFitPy results after bias calibration, supporting scalable NLTE analysis of millions of spectra. The approach promises practical impact for upcoming large surveys by enabling rapid, self-consistent, NLTE-informed stellar characterization and will be extended to more elements and dynamic S/N regimes, with plans for public release.
Abstract
Modern spectroscopic surveys obtain spectra for millions of stars. However, classical spectroscopic methods can often be computationally expensive, rendering them impractical for the analysis of large datasets. We introduce a novel simulation-based deep-learning approach for the efficient analysis of high-resolution stellar spectra to be obtained with the upcoming high-resolution 4MOST spectrograph. We used a suite of synthetic non-local thermodynamic equilibrium (NLTE) spectra generated with Turbospectrum to mimic 4MOST observations and trained a conditional invertible neural network (cINN) for the purpose of predicting self-consistently stellar surface parameters and chemical abundances. The cINN is a neural network architecture that estimates full posterior distributions for the target stellar properties, providing an intrinsic uncertainty estimate. We evaluated the predictive performance of the trained cINN model on both synthetic data and observed spectra of stars. We found that our new cINN trained on NLTE synthetic spectra is capable of recovering stellar parameters with average errors ($σ$) of $33$ K for $T_\mathrm{eff}$, $0.16$ dex for $\log(g)$, and $0.12$ dex for [Fe/H], $0.1$ dex for [Ca/Fe], $0.11$ for [Mg/Fe], and $0.51$ dex for [Li/Fe], respectively, at a signal to noise ratio of 250 per Angstrom. From the analysis of the observed spectra of Gaia-ESO / 4MOST / PLATO benchmark stars, we verified that our NLTE estimates for stellar parameters and abundances are consistent with results obtained with the independent code TSFitPy. We conclude that the NLTE cINN is robust and can, theoretically, evaluate 4 million high-resolution 4MOST spectra in less than a day, using GPU acceleration.
