Generative Artificial Intelligence for Air Shower Simulation
C. Bozza, A. Calivà, A. De Caro, D. De Gruttola, S. De Pasquale, L. A. Fusco, G. Messuti, C. Poirè, S. Scarpetta, T. Virgili
TL;DR
This work tackles the computational bottleneck of detailed extensive air shower simulations by introducing GAIAS2, a generative surrogate based on a Wasserstein GAN with Gradient Penalty and Self-Attention trained on CORSIKA proton-induced showers. A fixed 4D tensor representation and data-driven binning enable efficient learning of ground-level muon distributions, while an ensemble of 57 generators mitigates mode-coverage issues and achieves a Wasserstein distance of about 0.04 relative to training data. The trained model produces tens of thousands of samples in under a minute on a single GPU, offering roughly a $10^4$x speed-up over full MC simulations and a pathway toward real-time conditioning and broader applicability in astroparticle experiments. This approach is detector-agnostic and can be extended to multiple particle species and integrated with differentiable or hybrid simulation pipelines for rapid, scalable cosmic-ray studies.
Abstract
The detailed simulation of extensive air showers, produced by primary cosmic rays interacting in the atmosphere, is a task that is traditionally undertaken by means of Monte Carlo methods. These processes are computationally intensive, accounting for a major fraction of the computational resources used in the large-scale simulations required by current and future experiments in the field of astroparticle physics. In this work, we present a novel approach based on Generative Adversarial Networks (GANs) to accelerate air shower simulations. We developed and trained a GAN on a dataset of high-energy proton-induced air showers generated with \texttt{CORSIKA}; our model reproduces key distributions of secondary particles, such as energy spectra and spatial distributions at ground level of muons. Once the model has been trained, which takes approximately 74 hours, the generation real time per shower is reduced by a factor of $10^4$ with respect to the full \texttt{CORSIKA} simulation, leading to a substantial decrease in both computational time and energy consumption.
