Using Exascale Computing to Explain the Delicate Balance of Nuclear Forces in the Universe
M. A. Clark, A. Hanlon, D. Howarth, B. Joo, S. Krieg, D. McDougall, A. Meyer, H. Monge-Camacho, C. Morningstar, S. Park, F. Romero-López, P. M. Vranas, A. Walker-Loud
TL;DR
This work addresses how the delicate balance of nuclear forces emerges from QCD by targeting the deuteron binding energy and its cosmological implications for Big Bang Nucleosynthesis. It introduces a GPU-accelerated lattice QCD workflow built on LapH smearing (LapH) and the QUDA library, augmented with a batched Lanczos eigensolver and batched linear solves to enable two-nucleon calculations near physical quark masses on exascale hardware. The key contributions include the first GPU-enabled LapH implementation for two-nucleon studies, the introduction of quda_laph for end-to-end workflow management, and a scalable, memory-efficient eigenvector and projection strategy that yields up to approx 240× speedups over CPU baselines. The results demonstrate near-ideal weak scaling on multiple exascale-class systems and establish a practical pathway to connect QCD predictions with nuclear phenomenology, improving precision tests of the Standard Model and informing cosmological questions about the universe's hydrogen abundance.
Abstract
The vast majority of visible matter in our universe comes from protons and neutrons (the nucleons). Nucleon interactions are fundamental to how the universe developed after the Big Bang and govern all nuclear phenomena. The subtle balance in how two nucleons interact shapes the universe's hydrogen content that is central to our existence. Our objective is to compute the interaction strength while varying the parameters of nature to understand how delicate this balance is. We developed a new code using sophisticated physics algorithms and a highly optimized library for simulations on CPU-GPU parallel architectures. It has excellent weak scaling and impressive linear scaling for a fixed problem size with increasing number of nodes up to El Capitan's full $\sim$11,000 nodes. On Alps, El Capitan, Frontier, Jupiter, and Perlmutter supercomputers we achieve a maximum disruptive speed-up of $\sim$240 times the previous state-of-the-art, signaling a new era of supercomputing.
