Gaining (Mutual) Information about Quark/Gluon Discrimination
Andrew J. Larkoski, Jesse Thaler, Wouter J. Waalewijn
TL;DR
This work addresses the challenge of quark/gluon discrimination by reframing jet tagging with mutual information, separating true discrimination power from mere correlations. It introduces the generalized angularities ${\lambda^{\kappa}_{\beta}}$ to explore angular and energy-weighted radiation patterns across IRC-safe and IRC-unsafe regimes, and provides both parton-shower and analytic resummation insights. A key finding is that Casimir scaling at LL yields a universal truth overlap dependent on $C_A/C_F$, while NLL and nonperturbative effects (captured by weighted-energy functions) reveal meaningful differences and complementarities when combining observables. The study offers concrete predictions and methodological guidance for optimizing quark/gluon tagging and suggests targeted experimental measurements to refine nonperturbative inputs.These results advance understanding of when adding observables improves discrimination, quantify the extent of information gain from pairing angularities, and connect perturbative predictions with nonperturbative inputs, providing a framework for robust quark/gluon tagging in jet substructure.
Abstract
Discriminating quark jets from gluon jets is an important but challenging problem in jet substructure. In this paper, we use the concept of mutual information to illuminate the physics of quark/gluon tagging. Ideal quark/gluon separation requires only one bit of truth information, so even if two discriminant variables are largely uncorrelated, they can still share the same "truth overlap". Mutual information can be used to diagnose such situations, and thus determine which discriminant variables are redundant and which can be combined to improve performance. Using both parton showers and analytic resummation, we study a two-parameter family of generalized angularities, which includes familiar infrared and collinear (IRC) safe observables like thrust and broadening, as well as IRC unsafe variants like $p_T^D$ and hadron multiplicity. At leading-logarithmic (LL) order, the bulk of these variables exhibit Casimir scaling, such that their truth overlap is a universal function of the color factor ratio $C_A/C_F$. Only at next-to-leading-logarithmic (NLL) order can one see a difference in quark/gluon performance. For the IRC safe angularities, we show that the quark/gluon performance can be improved by combining angularities with complementary angular exponents. Interestingly, LL order, NLL order, Pythia 8, and Herwig++ all exhibit similar correlations between observables, but there are significant differences in the predicted quark/gluon discrimination power. For the IRC unsafe angularities, we show that the mutual information can be calculated analytically with the help of a nonperturbative "weighted-energy function", providing evidence for the complementarity of safe and unsafe observables for quark/gluon discrimination.
