Table of Contents
Fetching ...

CoLLM: AI engineering toolbox for end-to-end deep learning in collider analyses

W. Esmail, A. Hammad, M. Nojiri

TL;DR

CoLLM presents a physics-aware, end-to-end framework that uses LLMs to generate executable collider-analysis code from plain-language specifications and to automate downstream deep learning, all within a reproducible, plug-and-play workflow. It couples a physics-constrained code-generation engine with automated DL pipelines (MLP, GNN, Transformer) and a LangChain-based orchestrator that enforces deterministic decoding and iterative error correction. The approach is validated on five benchmark processes, demonstrating executable analyses, validated observables, and DL classifiers, while highlighting comparative advantages over generic LLM code generation. The work aims to substantially reduce coding and technical overhead in collider analyses while preserving physical reliability, with public availability and a clear roadmap for future extensions such as ROOT-format support and adaptive graph constructions.

Abstract

Recent improvements in large language models have opened new opportunities for accelerating and automating scientific workflows. In parallel, modern collider analyses are becoming increasingly complex and demand substantial programming and deep learning expertise. \coll alleviates this workload by using pretrained large language models to generate physically consistent analysis code for event selection. Additionally, it automates subsequent deep learning analyses. To further reduce reliance on programming or deep learning experience, \coll provides a graphical user interface that allows users to perform end-to-end analyses through an interactive interface. The main motivation behind \coll is to lower the coding burden and simplify the technical complexity of collider analyses, which increasingly depend on sophisticated event selections and advanced deep learning methods.

CoLLM: AI engineering toolbox for end-to-end deep learning in collider analyses

TL;DR

CoLLM presents a physics-aware, end-to-end framework that uses LLMs to generate executable collider-analysis code from plain-language specifications and to automate downstream deep learning, all within a reproducible, plug-and-play workflow. It couples a physics-constrained code-generation engine with automated DL pipelines (MLP, GNN, Transformer) and a LangChain-based orchestrator that enforces deterministic decoding and iterative error correction. The approach is validated on five benchmark processes, demonstrating executable analyses, validated observables, and DL classifiers, while highlighting comparative advantages over generic LLM code generation. The work aims to substantially reduce coding and technical overhead in collider analyses while preserving physical reliability, with public availability and a clear roadmap for future extensions such as ROOT-format support and adaptive graph constructions.

Abstract

Recent improvements in large language models have opened new opportunities for accelerating and automating scientific workflows. In parallel, modern collider analyses are becoming increasingly complex and demand substantial programming and deep learning expertise. \coll alleviates this workload by using pretrained large language models to generate physically consistent analysis code for event selection. Additionally, it automates subsequent deep learning analyses. To further reduce reliance on programming or deep learning experience, \coll provides a graphical user interface that allows users to perform end-to-end analyses through an interactive interface. The main motivation behind \coll is to lower the coding burden and simplify the technical complexity of collider analyses, which increasingly depend on sophisticated event selections and advanced deep learning methods.
Paper Structure (27 sections, 15 equations, 4 figures, 4 tables)

This paper contains 27 sections, 15 equations, 4 figures, 4 tables.

Figures (4)

  • Figure 1: Architecture overview of the CoLLM framework. The system accepts user-defined analysis specifications and LHCO data files as inputs, generates analysis code for the selection cuts, performs validation and automatic error correction, and prepares the input to the deep learning models.
  • Figure 2: Screenshot of the CoLLM graphical user interface.
  • Figure 3: Validation histograms produced by CoLLM for the semi-leptonic $t\bar{t}$ analysis generated by CoLLM . The distributions show (top row) jet multiplicity, $H_{\mathrm{T}}$ and missing transverse energy; (middle row) transverse mass, $W$ hadronic candidate mass, and top hadronic candidate mass; (bottom row) leading and subleading $b$-jet $p_{\mathrm{T}}$, leading $b$-jet $\eta$, and $\Delta R$ between the lepton and closest $b$-jet. All histograms are normalized to unit area.
  • Figure 4: Validation histograms generated by CoLLM for Example 1.