Physics Event Classification Using Large Language Models
Cristiano Fanelli, James Giroux, Patrick Moran, Hemalata Nayak, Karthik Suresh, Eric Walter
TL;DR
The paper evaluates whether a Large Language Model (ChatGPT-3.5) can drive a physics-focused ML task under tight access constraints by organizing an 8-hour hackathon using a Streamlit web interface and AWS GPU compute to classify GlueX BCAL showers as neutrons or photons with 14 features across two phase-space regimes. The approach demonstrated near-perfect accuracy across teams, with one team achieving top performance while minimizing prompt usage, illustrating the viability of LLM-assisted, domain-specific ML workflows in experimental physics. It also documents infrastructure, scoring, and data-collection practices to support future prompt-engineering research and education within the AI4EIC program. Overall, the work highlights the practical potential and actionable pathways for integrating LLMs into physics data analysis and ML toolchains, while outlining rich opportunities for systematic studies of prompt strategies in domain tasks.
Abstract
The 2023 AI4EIC hackathon was the culmination of the third annual AI4EIC workshop at The Catholic University of America. This workshop brought together researchers from physics, data science and computer science to discuss the latest developments in Artificial Intelligence (AI) and Machine Learning (ML) for the Electron Ion Collider (EIC), including applications for detectors, accelerators, and experimental control. The hackathon, held on the final day of the workshop, involved using a chatbot powered by a Large Language Model, ChatGPT-3.5, to train a binary classifier neutrons and photons in simulated data from the \textsc{GlueX} Barrel Calorimeter. In total, six teams of up to four participants from all over the world took part in this intense educational and research event. This article highlights the hackathon challenge, the resources and methodology used, and the results and insights gained from analyzing physics data using the most cutting-edge tools in AI/ML.
