Boosted decision tree reweighting of simulated neutrino interactions for $O(1)$ GeV neutrino cross-section measurements
Z. Lin, S. Akhter, Z. Ahmad Dar, N. S. Alex, M. Betancourt, S. Boyd, H. Budd, G. Caceres, G. A. Díaz, J. Felix, L. Fields, A. M. Gago, P. K. Gaur, S. M. Gilligan, R. Gran, D. A. Harris, A. L. Hart, J. Kleykamp, A. Klustová, D. Last, A. Lozano, X. -G. Lu, S. Manly, W. A. Mann, K. S. McFarland, O. Moreno, J. K. Nelson, V. Paolone, G. N. Perdue, C. Pernas, M. A. Ramírez, N. Roy, D. Ruterbories, H. Schellman, C. J. Solano Salinas, D. S. Correia, M. Sultana, N. H. Vaughan, A. V. Waldron, B. Yaeggy, L. Zazueta
TL;DR
This work presents a generic multidimensional reweighting framework for $O(1)$ GeV neutrino MC using a Boosted Decision Tree Reweighter to map a source generator’s final-state distributions onto a target model’s expectations, thereby enabling efficient reuse of legacy simulations. By organizing events into controllable topology-based categories and training per-category reweighters on detector-relevant observables, the method achieves improved agreement with the target across high-dimensional spaces and derived quantities such as TKIs and efficiencies. The approach is validated by reweighting GENIE v2.12.6 to GENIE v3.04 AR23 for MINERvA’s CCQE-like $ u_$-carbon sample, demonstrating reduced KS distances and more accurate efficiency predictions, with clear guidance for generalizing to other channels and generators. This technique offers practical benefits by avoiding full MC regeneration and facilitating systematic studies using legacy data while underscoring the need to propagate target-model uncertainties through reweighted predictions.
Abstract
This paper illustrates a generic method for multi-dimensional reweighting of $O(1)$ GeV neutrino interaction Monte Carlo samples. The reweighting is based on a Boosted Decision Tree algorithm trained on high-dimensional space in detector final state observables. This enables one generator's events to be reweighted so that its reconstructed particle content and kinematics distributions, as well as detector efficiency, match those of a target model. The approach establishes an efficient way to reuse legacy Monte Carlo data, avoiding re-generation. As an example, we test its use in a measurement of transverse kinematic imbalance of the $μ^-$ and proton in charged-current quasielastic like $ν_μ$ events from the MINERvA experiment.
