Bridging Theory and Data: Correcting Nuclear Mass Models with Interpretable Machine Learning

Yanhua Lu; Tianshuai Shang; Pengxiang Du; Jian Li; Haozhao Liang

Bridging Theory and Data: Correcting Nuclear Mass Models with Interpretable Machine Learning

Yanhua Lu, Tianshuai Shang, Pengxiang Du, Jian Li, Haozhao Liang

Abstract

Nuclear mass prediction is one of the core issues in nuclear physics research, yet it faces the challenge of small-sample datasets with high complexity. This study introduces the Kolmogorov-Arnold Network (KAN) into the refinement of nuclear mass models, proposing an efficient and interpretable solution. By constructing the KAN-WS4 hybrid model, the prediction accuracy is significantly improved (the root mean square error is reduced from 0.3 MeV to 0.16 MeV). Furthermore, leveraging the intrinsic interpretability of KAN, feature importance analysis reveals that the proton number is the most critical factor influencing residuals, indicating potential systematic biases in proton-related terms within existing theoretical models. The method's generality is demonstrated across five mass models. This study shows that KAN provides a novel approach to small-sample, high-complexity scientific problems. Its interpretability facilitates the data-driven discovery of physical laws, promising broad applicability to key nuclear physics issues.

Bridging Theory and Data: Correcting Nuclear Mass Models with Interpretable Machine Learning

Abstract

Paper Structure (6 sections, 4 equations, 4 figures, 1 table)

This paper contains 6 sections, 4 equations, 4 figures, 1 table.

introduction
THEORETICAL FRAMEWORK
Kolmogorov-Arnold network
Small-sample datasets with high complexity
Results and discussion
SUMMARY AND PROSPECTS

Figures (4)

Figure 1: Comparison between (a) traditional machine learning frameworks (Multi-Layer Perceptron, MLP) and (b) Kolmogorov-Arnold Networks (KAN).
Figure 2: Differences (in MeV) between the nuclear masses predicted by KAN models and the experimental nuclear masses. (a) Data distributions of the training set and test set; (b) all dataset: (c) training set; (d) test set.
Figure 3: The training and test mass differences $\Delta M$ (in MeV) of the KAN model (red dots) with the WS4 nuclear mass model (black squares) as a reference. The shade (white) regions indicate the training (test) areas. (a) The Sn isotopic chain; (b) The Pb isotopic chain; (c) $N=82$ isotonic chain; (d) $N=126$ isotonic chain.
Figure 4: (a) Framework diagram of a KAN trained on the nuclear mass residual dataset, and (b) the corresponding analysis of feature importance.

Bridging Theory and Data: Correcting Nuclear Mass Models with Interpretable Machine Learning

Abstract

Bridging Theory and Data: Correcting Nuclear Mass Models with Interpretable Machine Learning

Authors

Abstract

Table of Contents

Figures (4)