Statistical post-processing yields accurate probabilistic forecasts from Artificial Intelligence weather models
Belinda Trotta, Robert Johnson, Catherine de Burgh-Day, Debra Hudson, Esteban Abellan, James Canvin, Andrew Kelly, Daniel Mentiplay, Benjamin Owen, Jennifer Whelan
TL;DR
The paper tackles biases and reliability issues in AI-based weather forecasts by applying the Bureau of Meteorology's IMPROVER statistical post-processing to ECMWF's AI forecasting system (AIFS) and comparing results to traditional NWP post-processing for HRES and ENS. It demonstrates that IMPROVER can significantly improve both deterministic and probabilistic outputs for AIFS without modifying existing workflows, and shows that blending AI forecasts with NWP forecasts yields further gains in forecast skill. Key findings include comparable calibration and CRPS improvements for AIFS relative to NWP, and consistent benefits from including AIFS in blended forecasts. The study provides a practical pathway for national meteorological centers to integrate AI-based forecasts into current operational systems in a low-risk, incremental manner, enhancing overall forecasting robustness and utility.
Abstract
Artificial Intelligence (AI) weather models are now reaching operational-grade performance for some variables, but like traditional Numerical Weather Prediction (NWP) models, they exhibit systematic biases and reliability issues. We test the application of the Bureau of Meteorology's existing statistical post-processing system, IMPROVER, to ECMWF's deterministic Artificial Intelligence Forecasting System (AIFS), and compare results against post-processed outputs from the ECMWF HRES and ENS models. Without any modification to processing workflows, post-processing yields comparable accuracy improvements for AIFS as for traditional NWP forecasts, in both expected value and probabilistic outputs. We show that blending AIFS with NWP models improves overall forecast skill, even when AIFS alone is not the most accurate component. These findings show that statistical post-processing methods developed for NWP are directly applicable to AI models, enabling national meteorological centres to incorporate AI forecasts into existing workflows in a low-risk, incremental fashion.
