Huge Ensembles Part II: Properties of a Huge Ensemble of Hindcasts Generated with Spherical Fourier Neural Operators
Ankur Mahesh, William Collins, Boris Bonev, Noah Brenowitz, Yair Cohen, Peter Harrington, Karthik Kashinath, Thorsten Kurth, Joshua North, Travis OBrien, Michael Pritchard, David Pruitt, Mark Risser, Shashank Subramanian, Jared Willard
TL;DR
The paper advances extreme-weather statistics by creating Huge Ensembles (HENS) of hindcasts using Spherical Fourier Neural Operators with bred vectors and multiple checkpoints to achieve a large, scalable sample of possible atmospheric trajectories. It demonstrates tail sampling and improved probabilistic forecasts, using metrics such as information gain and owCRPS, while comparing against smaller ensembles and traditional forecasts. The study also tackles practical challenges of generating, storing, and reproducing petabyte-scale ML ensembles, and discusses how such ensembles can complement traditional NWP for studying LLHIs and improving uncertainty quantification. Overall, HENS offers a scalable, data-driven tool to study extreme events and counterfactual weather scenarios, with open resources to support reproducibility and further research.
Abstract
In Part I, we created an ensemble based on Spherical Fourier Neural Operators. As initial condition perturbations, we used bred vectors, and as model perturbations, we used multiple checkpoints trained independently from scratch. Based on diagnostics that assess the ensemble's physical fidelity, our ensemble has comparable performance to operational weather forecasting systems. However, it requires orders of magnitude fewer computational resources. Here in Part II, we generate a huge ensemble (HENS), with 7,424 members initialized each day of summer 2023. We enumerate the technical requirements for running huge ensembles at this scale. HENS precisely samples the tails of the forecast distribution and presents a detailed sampling of internal variability. HENS has two primary applications: (1) as a large dataset with which to study the statistics and drivers of extreme weather and (2) as a weather forecasting system. For extreme climate statistics, HENS samples events 4$σ$ away from the ensemble mean. At each grid cell, HENS increases the skill of the most accurate ensemble member and enhances coverage of possible future trajectories. As a weather forecasting model, HENS issues extreme weather forecasts with better uncertainty quantification. It also reduces the probability of outlier events, in which the verification value lies outside the ensemble forecast distribution.
