Improving the performance of weak supervision searches using transfer and meta-learning
Hugues Beauchesne, Zong-En Chen, Cheng-Wei Chiang
TL;DR
Weak supervision searches in collider physics are limited by the need for sizable signal to train neural networks. The authors address this bottleneck by combining transfer learning, via pretraining on simulations, with meta-learning to create fast-learning networks that adapt with less experimental data. Using dark-shower signals generated with the Pythia Hidden Valley module and a CNN-based CWoLa framework, they show that transfer learning substantially lowers the signal required for discovery (often by a factor of a few to reach $5\sigma$) and that meta-transfer learning provides additional gains. This work offers a practical proof-of-principle for making weak supervision more robust to limited signal and points to future work on model choices and systematic uncertainties.
Abstract
Weak supervision searches have in principle the advantages of both being able to train on experimental data and being able to learn distinctive signal properties. However, the practical applicability of such searches is limited by the fact that successfully training a neural network via weak supervision can require a large amount of signal. In this work, we seek to create neural networks that can learn from less experimental signal by using transfer and meta-learning. The general idea is to first train a neural network on simulations, thereby learning concepts that can be reused or becoming a more efficient learner. The neural network would then be trained on experimental data and should require less signal because of its previous training. We find that transfer and meta-learning can substantially improve the performance of weak supervision searches.
