Higgs Production Classifier using Weak Supervision
Kai-Feng Chen, Yi-An Chen, Cheng-Wei Chiang, Feng-Yang Hsieh
TL;DR
The paper tackles distinguishing Higgs production mechanisms, specifically vector-b boson fusion (VBF) and gluon-gluon fusion (GGF), in hadron collider data without event-level truth labels. It implements the Classification Without Labels (CWoLa) weakly supervised framework using real diphoton data, comparing convolutional neural networks (CNN) and a Particle Transformer (ParT) on image and set representations, respectively. A physics-motivated azimuthal augmentation ($\phi$-shifting) is introduced to boost training statistics, and the authors test decay-channel transferability across $H \to \gamma\gamma$, $H \to ZZ\to 4\ell$, and $H \to Z\gamma\to 2\ell\gamma$, observing strong cross-channel generalization especially when decay products are not affected by strong QCD. The results show that hadronic activity provides most discriminative power, that augmentation improves performance at low data regimes, and that decay-agnostic classifiers can be effectively reused across channels, enabling data-efficient Higgs analyses at the LHC.
Abstract
A reliable determination of the Higgs production mechanism in hadron collider experiments is essential in the program of the measurements of the Higgs couplings. We employ weak supervision, CWoLa in particular, to train deep neural networks using real data of the diphoton events, in the hope of reducing biases resulting from Monte Carlo simulations. Models based on the convolutional neural network and the transformer are tested and compared. In particular, the classification performance gets slightly better when the photon information is removed from training on the low-luminosity region of $H \to γγ$. We explicitly show that the performance can be improved when the training dataset is enlarged by data augmentation using physics-motivated methods. We further demonstrate that the trained model can be successfully applied to the $H \to ZZ$ and $H \to Zγ$ events, showing that such classifiers are agnostic to Higgs decay modes provided they do not involve strong QCD corrections.
