ACE Metric: Advection and Convection Evaluation for Accurate Weather Forecasting
Doyi Kim, Minseok Seo, Yeji Choi
TL;DR
The paper addresses the inadequacy of conventional metrics in evaluating data-driven weather forecasts, which often produce blur that inflates RMSE. It introduces the Advection and Convection Error (ACE) metric, decomposing forecast errors into advection (horizontal transport) and convection (vertical development) components via velocity-field estimation and a remapping step, with ACE = AE + CE/AE. Validations on WeatherBench2 and MovingMNIST demonstrate that ACE captures critical atmospheric dynamics, with methods explicitly modeling advection/convection achieving higher ACE and blurred outputs failing to improve ACE. The work provides a principled framework for assessing and guiding improvements in regional weather forecasting, potentially improving reliability for hazardous weather preparedness.
Abstract
Recently, data-driven weather forecasting methods have received significant attention for surpassing the RMSE performance of traditional NWP (Numerical Weather Prediction)-based methods. However, data-driven models are tuned to minimize the loss between forecasted data and ground truths, often using pixel-wise loss. This can lead to models that produce blurred outputs, which, despite being significantly different in detail from the actual weather conditions, still demonstrate low RMSE values. Although evaluation metrics from the computer vision field, such as PSNR, SSIM, and FVD, can be used, they are not entirely suitable for weather variables. This is because weather variables exhibit continuous physical changes over time and lack the distinct boundaries of objects typically seen in computer vision images. To resolve these issues, we propose the advection and convection Error (ACE) metric, specifically designed to assess how well models predict advection and convection, which are significant atmospheric transfer methods. We have validated the ACE evaluation metric on the WeatherBench2 and MovingMNIST datasets.
