PMT Waveform Simulation and Reconstruction with Conditional Diffusion Network
Kainan Liu, Jingyu Huang, Guihong Huang, Jianyi Luo
TL;DR
This work tackles PMT waveform reconstruction under multi-photon, high-overlap conditions by introducing a bidirectional conditional diffusion framework that jointly simulates waveforms from PE sequences and reconstructs PE sequences from waveforms in a weakly supervised setting. The approach comprises two diffusion modules, DFA and DFB, operating in a loop that iteratively refines waveform realism and PE inference, with ResNet50 provided as a performance benchmark. Supervised results establish strong baselines for both waveform simulation and nPE/timing reconstruction, while the weakly supervised regime demonstrates practical viability when ground-truth labels are unavailable, achieving about 99% of the supervised nPE resolution and around 0.5 ns timing precision under favorable data distributions. The method offers a data-driven alternative for detector-level waveform modeling, reducing reliance on exact PE labels and enabling broader applicability to vertex and energy reconstruction in neutrino experiments.
Abstract
Photomultiplier tubes (PMTs) are widely employed in particle and nuclear physics experiments. The accuracy of PMT waveform reconstruction directly impacts the detector's spatial and energy resolution. A key challenge arises when multiple photons arrive within a few nanoseconds, making it difficult to resolve individual photoelectrons (PEs). Although supervised deep learning methods have surpassed traditional methods in performance, their practical applicability is limited by the lack of ground-truth PE labels in real data. To address this issue, we propose an innovative weakly supervised waveform simulation and reconstruction approach based on a bidirectional conditional diffusion network framework. The method is fully data-driven and requires only raw waveforms and coarse estimates of PE information as input. It first employs a PE-conditioned diffusion model to simulate realistic waveforms from PE sequences, thereby learning the features of overlapping waveforms. Subsequently, these simulated waveforms are used to train a waveform-conditioned diffusion model to reconstruct the PE sequences from waveforms, reinforcing the learning of features of overlapping waveforms. Through iterative refinement between the two conditional diffusion processes, the model progressively improves reconstruction accuracy. Experimental results demonstrate that the proposed method achieves 99% of the normalized PE-number resolution averaged over 1-5 p.e. and 80% of the timing resolution attained by fully supervised learning.
