AIFS-CRPS: Ensemble forecasting using a model trained with a loss function based on the Continuous Ranked Probability Score
Simon Lang, Mihai Alexe, Mariana C. A. Clare, Christopher Roberts, Rilwan Adewoyin, Zied Ben Bouallègue, Matthew Chantry, Jesper Dramsch, Peter D. Dueben, Sara Hahner, Pedro Maciel, Ana Prieto-Nemesio, Cathal O'Brien, Florian Pinault, Jan Polster, Baudouin Raoult, Steffen Tietsche, Martin Leutbecher
TL;DR
ECMWF introduces AIFS-CRPS, a probabilistic machine-learned ensemble forecast model trained with the almost fair CRPS (afCRPS) loss to generate exchangeable stochastic members. The approach combines a transformer-based encoder–processor–decoder with a four-stage training regime and rollout mitigation via reference-field downsampling, delivering realistic variability and improved skill for medium-range and subseasonal forecasts. Across O96 and N320 grid configurations, AIFS-CRPS generally outperforms the 9 km IFS ensemble for many upper-air and several surface variables, while maintaining cheap inference and scalable ensemble generation. Remaining challenges include stratospheric performance tied to loss scaling and initial-condition perturbations, with planned work to address these and extend observation-based training. The results suggest a promising path to real-time, probabilistic forecasts that leverage machine learning for calibrated uncertainty estimates.
Abstract
Over the last three decades, ensemble forecasts have become an integral part of forecasting the weather. They provide users with more complete information than single forecasts as they permit to estimate the probability of weather events by representing the sources of uncertainties and accounting for the day-to-day variability of error growth in the atmosphere. This paper presents a novel approach to obtain a weather forecast model for ensemble forecasting with machine-learning. AIFS-CRPS is a variant of the Artificial Intelligence Forecasting System (AIFS) developed at ECMWF. Its loss function is based on a proper score, the Continuous Ranked Probability Score (CRPS). For the loss, the almost fair CRPS is introduced because it approximately removes the bias in the score due to finite ensemble size yet avoids a degeneracy of the fair CRPS. The trained model is stochastic and can generate as many exchangeable members as desired and computationally feasible in inference. For medium-range forecasts AIFS-CRPS outperforms the physics-based Integrated Forecasting System (IFS) ensemble for the majority of variables and lead times. For subseasonal forecasts, AIFS-CRPS outperforms the IFS ensemble before calibration and is competitive with the IFS ensemble when forecasts are evaluated as anomalies to remove the influence of model biases.
