Computing the Full Earth System at 1 km Resolution
Daniel Klocke, Claudia Frauen, Jan Frederik Engels, Dmitry Alexeev, René Redler, Reiner Schnur, Helmuth Haak, Luis Kornblueh, Nils Brüggemann, Fatemeh Chegini, Manoel Römmer, Lars Hoffmann, Sabine Griessbach, Mathis Bode, Jonathan Coles, Miguel Gila, William Sawyer, Alexandru Calotoiu, Yakup Budanaz, Pratyai Mazumder, Marcin Copik, Benjamin Weber, Andreas Herten, Hendryk Bockelmann, Torsten Hoefler, Cathy Hohenegger, Bjorn Stevens
TL;DR
This work demonstrates for the first time a global, fully coupled Earth system simulation at $1.25$-km resolution with substantial temporal compression, achieved by leveraging heterogeneous GH200-based GPUs and CPUs. The authors map atmosphere/land to GPUs while placing ocean/biogeochemistry on CPUs, coordinated via a coupling interface, and they separate concerns by translating the Fortran dynamical core into a dataflow representation with DaCe, enabling architecture-specific optimizations without touching domain code. Key contributions include a final component mapping strategy, a DaCe-based Fortran-to-GPU workflow that yields large reductions in index lookups and memory bandwidth improvements, and strong/weak scaling results that reach up to $\\tau=145.7$ simulated days per day on high-end systems. The result is a scalable path to year-to-decade, full-Earth-system studies at km-scale, with significant implications for understanding climate feedbacks and informing adaptation under future warming scenarios, while illustrating a productive collaboration model between domain scientists and computer scientists for exascale-ready climate computing.
Abstract
We present the first-ever global simulation of the full Earth system at 1.25 km grid spacing, achieving highest time compression with an unseen number of degrees of freedom. Our model captures the flow of energy, water, and carbon through key components of the Earth system: atmosphere, ocean, and land. To achieve this landmark simulation, we harness the power of 8192 GPUs on Alps and 20480 GPUs on JUPITER, two of the world's largest GH200 superchip installations. We use both the Grace CPUs and Hopper GPUs by carefully balancing Earth's components in a heterogeneous setup and optimizing acceleration techniques available in ICON's codebase. We show how separation of concerns can reduce the code complexity by half while increasing performance and portability. Our achieved time compression of 145.7 simulated days per day enables long studies including full interactions in the Earth system and even outperforms earlier atmosphere-only simulations at a similar resolution.
