Advancing Physics Data Analysis through Machine Learning and Physics-Informed Neural Networks
Vasileios Vatellis
TL;DR
This paper tackles efficient physics data analysis by benchmarking multiple ML approaches, including Physics-Informed Neural Networks (PINNs), on a binary classification task that separates experimentally viable parameter points in a BGL-based model. It demonstrates that XGBoost excels as a fast initial filter, while standard neural networks and PINNs achieve higher accuracy and physics-consistency at the cost of longer training times; the best PINN configuration yields a strong balance between precision and physics adherence. The study introduces a dual-output PINN architecture with a physics-informed loss term $\mathcal{L}_{\text{total}} = \mathcal{L}_{\text{classification}} + \lambda \mathcal{L}_{\text{physics}}$ (with $\lambda = 1$) and shows that incorporating physical constraints can improve interpretability and reliability. Overall, the work highlights the trade-offs between computational efficiency and model sophistication in physics data analysis and outlines concrete directions for generalization, uncertainty quantification, and ensemble approaches.
Abstract
In an era increasingly focused on green computing and explainable AI, revisiting traditional approaches in theoretical and phenomenological particle physics is paramount. This project evaluates various machine learning (ML) algorithms-including Nearest Neighbors, Decision Trees, Random Forest, AdaBoost, Naive Bayes, Quadratic Discriminant Analysis (QDA), and XGBoost-alongside standard neural networks and a novel Physics-Informed Neural Network (PINN) for physics data analysis. We apply these techniques to a binary classification task that distinguishes the experimental viability of simulated scenarios based on Higgs observables and essential parameters. Through this comprehensive analysis, we aim to showcase the capabilities and computational efficiency of each model in binary classification tasks, thereby contributing to the ongoing discourse on integrating ML and Deep Neural Networks (DNNs) into physics research. In this study, XGBoost emerged as the preferred choice among the evaluated machine learning algorithms for its speed and effectiveness, especially in the initial stages of computation with limited datasets. However, while standard Neural Networks and Physics-Informed Neural Networks (PINNs) demonstrated superior performance in terms of accuracy and adherence to physical laws, they require more computational time. These findings underscore the trade-offs between computational efficiency and model sophistication.
