Designing Observables for Measurements with Deep Learning
Owen Long, Benjamin Nachman
TL;DR
The paper addresses the challenge of extracting physics parameters from unfolding analyses where observables are manually chosen and distorted by detector effects. It introduces a neural-network observable trained with a two-term loss $L[f] = L_{\text{classic}}[f(z), \\mu] + \\lambda\\, L_{\text{new}}[f(x), f(z)]$ to maximize parameter sensitivity while suppressing detector distortions. The authors validate the approach on toy continuous-parameter estimation and a binary-discrimination task in deep inelastic scattering (H1/DIS), demonstrating improved discrimination power and reduced unfolding dependence compared with classical observables. The work offers a path toward more informative unfolded cross sections and potential applications in precise measurements and MC-tuning, with open-source code for replication.
Abstract
Many analyses in particle and nuclear physics use simulations to infer fundamental, effective, or phenomenological parameters of the underlying physics models. When the inference is performed with unfolded cross sections, the observables are designed using physics intuition and heuristics. We propose to design targeted observables with machine learning. Unfolded, differential cross sections in a neural network output contain the most information about parameters of interest and can be well-measured by construction. The networks are trained using a custom loss function that rewards outputs that are sensitive to the parameter(s) of interest while simultaneously penalizing outputs that are different between particle-level and detector-level (to minimize detector distortions). We demonstrate this idea in simulation using two physics models for inclusive measurements in deep inelastic scattering. We find that the new approach is more sensitive than classical observables at distinguishing the two models and also has a reduced unfolding uncertainty due to the reduced detector distortions.
