Table of Contents
Fetching ...

JetPrism: diagnosing convergence for generative simulation and inverse problems in nuclear physics

Zeyu Xia, Tyler Kim, Trevor Reed, Judy Fox, Geoffrey Fox, Adam Szczepaniak

Abstract

High-fidelity Monte Carlo simulations and complex inverse problems, such as mapping smeared experimental observations to ground-truth states, are computationally intensive yet essential for robust data analysis. Conditional Flow Matching (CFM) offers a mathematically robust approach to accelerating these tasks, but we demonstrate its standard training loss is fundamentally misleading. In rigorous physics applications, CFM loss plateaus prematurely, serving as an unreliable indicator of true convergence and physical fidelity. To investigate this disconnect, we designed JetPrism, a configurable CFM framework acting as an efficient generative surrogate for evaluating unconditional generation and conditional detector unfolding. Using synthetic stress tests and a Jefferson Lab kinematic dataset ($γp \to ρ^0 p \to π^+π^- p$) relevant to the forthcoming Electron-Ion Collider (EIC), we establish that physics-informed metrics continue to improve significantly long after the standard loss converges. Consequently, we propose a multi-metric evaluation protocol incorporating marginal and pairwise $χ^2$ statistics, $W_1$ distances, correlation matrix distances ($D_{\mathrm{corr}}$), and nearest-neighbor distance ratios ($R_{\mathrm{NN}}$). By demonstrating that domain-specific evaluations must supersede generic loss metrics, this work establishes JetPrism as a dependable generative surrogate that ensures precise statistical agreement with ground-truth data without memorizing the training set. While demonstrated in nuclear physics, this diagnostic framework is readily extensible to parameter generation and complex inverse problems across broad domains. Potential applications span medical imaging, astrophysics, semiconductor discovery, and quantitative finance, where high-fidelity simulation, rigorous inversion, and generative reliability are critical.

JetPrism: diagnosing convergence for generative simulation and inverse problems in nuclear physics

Abstract

High-fidelity Monte Carlo simulations and complex inverse problems, such as mapping smeared experimental observations to ground-truth states, are computationally intensive yet essential for robust data analysis. Conditional Flow Matching (CFM) offers a mathematically robust approach to accelerating these tasks, but we demonstrate its standard training loss is fundamentally misleading. In rigorous physics applications, CFM loss plateaus prematurely, serving as an unreliable indicator of true convergence and physical fidelity. To investigate this disconnect, we designed JetPrism, a configurable CFM framework acting as an efficient generative surrogate for evaluating unconditional generation and conditional detector unfolding. Using synthetic stress tests and a Jefferson Lab kinematic dataset () relevant to the forthcoming Electron-Ion Collider (EIC), we establish that physics-informed metrics continue to improve significantly long after the standard loss converges. Consequently, we propose a multi-metric evaluation protocol incorporating marginal and pairwise statistics, distances, correlation matrix distances (), and nearest-neighbor distance ratios (). By demonstrating that domain-specific evaluations must supersede generic loss metrics, this work establishes JetPrism as a dependable generative surrogate that ensures precise statistical agreement with ground-truth data without memorizing the training set. While demonstrated in nuclear physics, this diagnostic framework is readily extensible to parameter generation and complex inverse problems across broad domains. Potential applications span medical imaging, astrophysics, semiconductor discovery, and quantitative finance, where high-fidelity simulation, rigorous inversion, and generative reliability are critical.

Paper Structure

This paper contains 29 sections, 2 equations, 17 figures, 4 tables.

Figures (17)

  • Figure 1: An illustration of the CFM generation process. The model learns to transform a simple base Gaussian distribution (left) to a complex three-peak mixed-Gaussian target distribution (right) by learning the intermediate velocity field (middle).
  • Figure 2: Training metrics tracking over time during the MC-POM generation task, comparing the CFM loss $\mathcal{L}_\mathrm{CFM}$ against physics-informed indicators ($W_1$, NFE, and $D_{\mathrm{corr}}$).
  • Figure 3: Close-up comparison of generated and ground-truth distributions in two sharp cut-off regions of the $t$-channel.
  • Figure 4: Comparison of generated kinematic distributions produced by the CFM model against the ground truth on the JLab MC-POM dataset.
  • Figure 5: Unfolding results ($\sigma_{\mathrm{smear}}=1.0$) mapping detector-level distributions back to the original particle-level truth, compared with both the target ground-truth and the initial smeared detector-level distributions.
  • ...and 12 more figures