Beatnik: A Novel Global Communication Mini-Application
Jason A. Stewart, Patrick G. Bridges
TL;DR
Beatnik introduces a lightweight open-source mini-application that stresses complex global MPI communication patterns by simulating $3$D Raleigh-Taylor instabilities via the $Z$-Model on GPU-accelerated platforms. Built with the Cabana/Kokkos performance-portability stack, it provides high-, medium-, and low-order solvers that exercise FFT-based and far-field data redistributions to reveal diverse communication patterns. The paper details the architecture, implementation, and four benchmark setups, and demonstrates scalability up to $1024$ GPUs on the Lassen system, including configurations of the HeFFTe FFT library. Together, these contributions show Beatnik’s utility for exposing MPI bandwidth/latency bottlenecks, load imbalance, and solver-library interactions, guiding performance optimizations for complex HPC workloads.
Abstract
Beatnik is a novel open source mini-application that exercises the complex communication patterns often found in production codes but rarely found in benchmarks or mini-applications. It simulates 3D Raleigh-Taylor instabilities based on Pandya and Shkoller's Z-Model formulation using the Cabana performance portability framework. This paper presents both the high-level design and important implementation details about Beatnik along with four benchmark setups for evaluating different aspects of HPC communication system performance. Evaluation results demonstrate both Beatnik's scalability on modern accelerator-based systems using weak and strong scaling tests up to 1024 GPUs, along with Beatnik's ability to expose communication challenges in modern systems and solver libraries.
