Nyx-RT: Adaptive Ray Tracing in the Nyx Hydrodynamical Code
Nathan X. Marshak, Kathlynn Simotas, Zarija Lukić, Hyunbae Park, James Ahrens, Chris R. Johnson
TL;DR
This work addresses the challenge of accurately modeling inhomogeneous reionization within cosmological simulations by integrating a GPU-accelerated, adaptive ray tracing radiative transfer (RT) scheme into the Nyx hydrodynamics code via AMReX abstractions. The method combines adaptive ray tracing with a novel forward source-merging algorithm and a photon-conserving geometric overlap correction, enabling self-consistent radiation-hydro evolution at exascale scales. Key contributions include a portable, filter-based RT implementation, robust handling of low-density neighbor cells, and production-scale demonstrations (up to 4096 GPUs on a $4096^3$ grid) that show convergence in reionization history and Ly-$\alpha$ forest flux. The significance lies in delivering physically faithful RT within large cosmological volumes at high resolution, facilitating systematic studies of reionization with realistic source populations and feedback in a scalable, architecture-agnostic framework.
Abstract
Numerical methods for radiative transfer play a key role in modern-day astrophysics and cosmology, including study of the inhomogeneous reionization process. In this context, ray tracing methods are well-regarded for accuracy but notorious for high computational cost. In this work, we extend the capabilities of the Nyx N-body / hydrodynamics code, coupling radiation to gravitational and gas dynamics. We formulate adaptive ray tracing as a novel series of filters and transformations that can be used with AMReX particle abstractions, simplifying implementation and enabling portability across Exascale GPU architectures. To address computational cost, we present a new algorithm for merging sources, which significantly accelerates computation once reionization is well underway. Furthermore, we develop a novel prescription for geometric overlap correction with low-density neighbor cells. We perform verification and validation against standard analytic and numerical test problems. Finally, we demonstrate scaling to up to 1024 nodes and 4096 GPUs running multiphysics cosmological simulations, with 4096^3 Eulerian gas cells, 4096^3 dark matter particles, and ray tracing on a 1024^3 coarse grid. For these full cosmological simulations, we demonstrate convergence in terms of reionization history and post-ionization Lyman-alpha forest flux.
