Table of Contents
Fetching ...

PhyGHT: Physics-Guided HyperGraph Transformer for Signal Purification at the HL-LHC

Mohammed Rakib, Luke Vaughan, Shivang Patel, Flera Rizatdinova, Alexander Khanov, Atriya Sen

TL;DR

By accurately reconstructing the top quark's invariant mass, PhyGHT demonstrates how machine learning innovation and interdisciplinary collaboration can directly advance scientific discovery at the frontiers of experimental physics and enhance the HL-LHC's discovery potential.

Abstract

The High-Luminosity Large Hadron Collider (HL-LHC) at CERN will produce unprecedented datasets capable of revealing fundamental properties of the universe. However, realizing its discovery potential faces a significant challenge: extracting small signal fractions from overwhelming backgrounds dominated by approximately 200 simultaneous pileup collisions. This extreme noise severely distorts the physical observables required for accurate reconstruction. To address this, we introduce the Physics-Guided Hypergraph Transformer (PhyGHT), a hybrid architecture that combines distance-aware local graph attention with global self-attention to mirror the physical topology of particle showers formed in proton-proton collisions. Crucially, we integrate a Pileup Suppression Gate (PSG), an interpretable, physics-constrained mechanism that explicitly learns to filter soft noise prior to hypergraph aggregation. To validate our approach, we release a novel simulated dataset of top-quark pair production to model extreme pileup conditions. PhyGHT outperforms state-of-the-art baselines from the ATLAS and CMS experiments in predicting the signal's energy and mass correction factors. By accurately reconstructing the top quark's invariant mass, we demonstrate how machine learning innovation and interdisciplinary collaboration can directly advance scientific discovery at the frontiers of experimental physics and enhance the HL-LHC's discovery potential. The dataset and code are available at https://github.com/rAIson-Lab/PhyGHT

PhyGHT: Physics-Guided HyperGraph Transformer for Signal Purification at the HL-LHC

TL;DR

By accurately reconstructing the top quark's invariant mass, PhyGHT demonstrates how machine learning innovation and interdisciplinary collaboration can directly advance scientific discovery at the frontiers of experimental physics and enhance the HL-LHC's discovery potential.

Abstract

The High-Luminosity Large Hadron Collider (HL-LHC) at CERN will produce unprecedented datasets capable of revealing fundamental properties of the universe. However, realizing its discovery potential faces a significant challenge: extracting small signal fractions from overwhelming backgrounds dominated by approximately 200 simultaneous pileup collisions. This extreme noise severely distorts the physical observables required for accurate reconstruction. To address this, we introduce the Physics-Guided Hypergraph Transformer (PhyGHT), a hybrid architecture that combines distance-aware local graph attention with global self-attention to mirror the physical topology of particle showers formed in proton-proton collisions. Crucially, we integrate a Pileup Suppression Gate (PSG), an interpretable, physics-constrained mechanism that explicitly learns to filter soft noise prior to hypergraph aggregation. To validate our approach, we release a novel simulated dataset of top-quark pair production to model extreme pileup conditions. PhyGHT outperforms state-of-the-art baselines from the ATLAS and CMS experiments in predicting the signal's energy and mass correction factors. By accurately reconstructing the top quark's invariant mass, we demonstrate how machine learning innovation and interdisciplinary collaboration can directly advance scientific discovery at the frontiers of experimental physics and enhance the HL-LHC's discovery potential. The dataset and code are available at https://github.com/rAIson-Lab/PhyGHT
Paper Structure (63 sections, 24 equations, 7 figures, 6 tables)

This paper contains 63 sections, 24 equations, 7 figures, 6 tables.

Figures (7)

  • Figure 1: A Phoenix Event Displayphoenix depicting an event in the ATLAS inner tracking system in the HL-LHC conditions at $\langle\mu\rangle=60$ (left) and $\langle\mu\rangle=200$ (right) for signal (blue) and background (red) particles with $p_T>1.0$ GeV.
  • Figure 2: Overview of PhyGHT. It takes a heterogeneous graph of Tracks and Jets as input. It processes tracks through the Local Geometric (DA-GAT) block, followed by the Global Context block. The fused representations are filtered by the Pileup Suppression Gate (PSG) before being aggregated into the Jet representation via Hypergraph Attention for final regression.
  • Figure 3: 1D distribution and 2D correlation plots for Energy ($\hat{y}_E$) and Mass ($\hat{y}_M$) at $\langle \mu \rangle=60$ (top row) and $\langle \mu \rangle=200$ (bottom row).
  • Figure 4: Comparison of Energy and Mass Resolutions of PhyGHT with baselines at $\langle \mu \rangle=60$ and $\langle \mu \rangle=200$.
  • Figure 5: Mass Reconstruction after PhyGHT correction
  • ...and 2 more figures