Integrating High Performance In-Memory Data Streaming and In-Situ Visualization in Hybrid MPI+OpenMP PIC MC Simulations Towards Exascale
Jeremy J. Williams, Stefan Costea, Daniel Medeiros, Jordy Trilaksono, Pratibha Hegde, David Tskhakaya, Leon Kos, Ales Podolnik, Jakub Hromadka, Kevin A. Huck, Allen D. Malony, Frank Jenko, Erwin Laure, Stefano Markidis
TL;DR
This work tackles the I/O bottlenecks of large-scale PIC MC plasma simulations by integrating a hybrid MPI+OpenMP BIT1 with the openPMD standard and ADIOS2 SST for in-memory streaming and in-situ visualization. It implements OpenMP task-based parallelism for particle movement, enables streaming I/O via SST, and supports real-time checkpointing and visualization through openPMD, delivering measurable improvements in throughput, metadata handling, and data accessibility. Profiling across multiple HPC systems demonstrates reduced I/O overhead, scalable MPI communication, and real-time diagnostic capability, while identifying trade-offs in memory usage and read times. The approach offers a practical pathway to exascale-ready plasma simulations with faster insight and more efficient resource use, and it outlines future work to further enhance scalability and visualization integration.
Abstract
Efficient simulation of complex plasma dynamics is crucial for advancing fusion energy research. Particle-in-Cell (PIC) Monte Carlo (MC) simulations provide insights into plasma behavior, including turbulence and confinement, which are essential for optimizing fusion reactor performance. Transitioning to exascale simulations introduces significant challenges, with traditional file input/output (I/O) inefficiencies remaining a key bottleneck. This work advances BIT1, an electrostatic PIC MC code, by improving the particle mover with OpenMP task-based parallelism, integrating the openPMD streaming API, and enabling in-memory data streaming with ADIOS2's Sustainable Staging Transport (SST) engine to enhance I/O performance, computational efficiency, and system storage utilization. We employ profiling tools such as gprof, perf, IPM and Darshan, which provide insights into computation, communication, and I/O operations. We implement time-dependent data checkpointing with the openPMD API enabling seamless data movement and in-situ visualization for real-time analysis without interrupting the simulation. We demonstrate improvements in simulation runtime, data accessibility and real-time insights by comparing traditional file I/O with the ADIOS2 BP4 and SST backends. The proposed hybrid BIT1 openPMD SST enhancement introduces a new paradigm for real-time scientific discovery in plasma simulations, enabling faster insights and more efficient use of exascale computing resources.
