Application of Neural Networks for the Reconstruction of Supernova Neutrino Energy Spectra Following Fast Neutrino Flavor Conversions

Sajad Abbar; Meng-Ru Wu; Zewei Xiong

Application of Neural Networks for the Reconstruction of Supernova Neutrino Energy Spectra Following Fast Neutrino Flavor Conversions

Sajad Abbar, Meng-Ru Wu, Zewei Xiong

TL;DR

This work addresses fast flavor conversions in dense astrophysical neutrino environments by analyzing a realistic multi-energy gas where, under $\kappa \gg \omega$, all energies share a common survival probability determined by the energy-integrated spectrum. The authors train a physics-informed neural network (PINN) to predict the asymptotic outcomes from the initial energy-integrated moments and per-bin moments, leveraging domain knowledge through a targeted loss term and informative features such as the lepton-number crossing $\mu_c$. The PINN achieves errors as low as $\lesssim 6\%$ for electron-channel neutrino counts and $\lesssim 18\%$ for moments, outperforming a baseline NN and enabling practical incorporation of FFC physics into CCSN/NSM simulations. They also tackle full-spectrum reconstruction by employing a two-PINN scheme to respect conservation laws, achieving reasonable accuracy for typical SN accretion-phase spectra while highlighting tail-related challenges and opportunities for uncertainty quantification in future work.

Abstract

Neutrinos can undergo fast flavor conversions (FFCs) within extremely dense astrophysical environments such as core-collapse supernovae (CCSNe) and neutron star mergers (NSMs). In this study, we explore FFCs in a \emph{multi-energy} neutrino gas, revealing that when the FFC growth rate significantly exceeds that of the vacuum Hamiltonian, all neutrinos (regardless of energy) share a common survival probability dictated by the energy-integrated neutrino spectrum. We then employ physics-informed neural networks (PINNs) to predict the asymptotic outcomes of FFCs within such a multi-energy neutrino gas. These predictions are based on the first two moments of neutrino angular distributions for each energy bin, typically available in state-of-the-art CCSN and NSM simulations. Our PINNs achieve errors as low as $\lesssim6\%$ and $\lesssim 18\%$ for predicting the number of neutrinos in the electron channel and the relative absolute error in the neutrino moments, respectively.

Application of Neural Networks for the Reconstruction of Supernova Neutrino Energy Spectra Following Fast Neutrino Flavor Conversions

TL;DR

This work addresses fast flavor conversions in dense astrophysical neutrino environments by analyzing a realistic multi-energy gas where, under

, all energies share a common survival probability determined by the energy-integrated spectrum. The authors train a physics-informed neural network (PINN) to predict the asymptotic outcomes from the initial energy-integrated moments and per-bin moments, leveraging domain knowledge through a targeted loss term and informative features such as the lepton-number crossing

. The PINN achieves errors as low as

for electron-channel neutrino counts and

for moments, outperforming a baseline NN and enabling practical incorporation of FFC physics into CCSN/NSM simulations. They also tackle full-spectrum reconstruction by employing a two-PINN scheme to respect conservation laws, achieving reasonable accuracy for typical SN accretion-phase spectra while highlighting tail-related challenges and opportunities for uncertainty quantification in future work.

Abstract

and

for predicting the number of neutrinos in the electron channel and the relative absolute error in the neutrino moments, respectively.

Paper Structure (9 sections, 25 equations, 6 figures)

This paper contains 9 sections, 25 equations, 6 figures.

Introduction
FFCs in a multi-energy neutrino gas
Results of the simulations
Survival probability function
Applications of Neural Networks
The architecture of NNs
The NN's performance
Reconstruction of neutrino energy spectra
DISCUSSION AND OUTLOOK

Figures (6)

Figure 1: Comparison of the spanned range of spatially averaged survival probabilities for neutrinos with different $\omega$ (grey shaded area; see Eq. \ref{['eq:Psur-z']}), with the survival probability averaged over space and $\omega$ (black dashed curve; see Eq. \ref{['eq:Psur-zw']}), at the final time of the simulation for a system with $\omega\ll \lambda$. Also shown is the analytical prescription for $\langle P_{\rm sur} (\mu,\omega) \rangle_f$ (blue dotted curve) described in Eq. (\ref{['eq:sur2f']}). Note that the grey-shaded area basically overlaps with the black dashed line, implying that the survival probabilities are nearly independent of the neutrino energy.
Figure 2: Schematic architecture of our NNs. The green zone shows the implementation of the extra features, $\mu_c$, and $E_{RL}$, which are obtained through an extra layer of regression, using linear and logistic regressions, respectively. Here, $\mu_c$ is the crossing direction and $E_{RL}$ is a binary, which is 1 if the equilibrium occurs for $\mu_c\leq\mu$, and 0 otherwise. The blue zone represents energy-integrated inputs, while the orange zone displays inputs for specific energy bins. Note that the neutrino number densities in the particular energy bin must be smaller than the corresponding energy-integrated values. In our basic NN, referred to as the NN with no extra features, the NN only takes the inputs highlighted in Eq. (\ref{['eq:inputs']}). However and in our PINN, we provide our NN with the extra features $\mu_c$ and $E_{RL}$.
Figure 3: Performance evaluation of the PINN and the basic NN with no extra features. We present the relative absolute error in the output parameters, along with the relative error in the total number of neutrinos within the electron channel, $N_{{\nu_e} + {\bar{\nu}_e}}$. It is evident that the PINN can well outperform the basic NN with no extra features. Here, an epoch refers to a single pass through the entire training dataset during the training phase.
Figure 4: Performance evaluation of our PINN and the basic NN with no extra features (on the validation set) as a function of the number of neurons in the hidden layer. It is evident that the NNs have achieved their best performance on the validation set once $n_h\gtrsim 150$. The labels and NN models are the same as those in Fig. \ref{['fig:error']}.
Figure 5: Absolute relative error in the output of our PINN (red curve) vs the relative error in the number of neutrinos in electron channel, i.e., $N_{{\nu_e} + {\bar{\nu}_e}}$ (blue curve). The inclusion of a few thousand data points in the training set leads to the disappearance of error variations between the validation and training sets. Note that we do not display the absolute relative error in the training set. This is due to the presence of small $I's$ in the training set (which are removed from the test set), causing a significantly larger relative error. Hence, any direct comparisons between the absolute relative errors in the training and test sets would be unfair.
...and 1 more figures

Application of Neural Networks for the Reconstruction of Supernova Neutrino Energy Spectra Following Fast Neutrino Flavor Conversions

TL;DR

Abstract

Application of Neural Networks for the Reconstruction of Supernova Neutrino Energy Spectra Following Fast Neutrino Flavor Conversions

Authors

TL;DR

Abstract

Table of Contents

Figures (6)