A machine-learning approach to thunderstorm forecasting through post-processing of simulation data
Kianusch Vahid Yousefnia, Tobias Bölle, Isabella Zöbisch, Thomas Gerz
TL;DR
This work tackles the challenge of reliable thunderstorm forecasting by post-processing high-resolution ensemble forecasts with a neural-network named SALAMA. SALAMA ingests 21 physically related NWP predictors to output a pixelwise thunderstorm probability, calibrated to climatology and trained on simulated data tied to LINET lightning observations. It demonstrates superior skill to a reflectivity-based baseline across lead times up to $11\ \mathrm{h}$ and reveals how prediction skill scales with the spatiotemporal label resolution and the intrinsic NWP spread. The findings suggest operational applicability with real-time data and highlight the importance of scale and ensemble uncertainty in shaping forecast reliability and usefulness.
Abstract
Thunderstorms pose a major hazard to society and economy, which calls for reliable thunderstorm forecasts. In this work, we introduce a Signature-based Approach of identifying Lightning Activity using MAchine learning (SALAMA), a feedforward neural network model for identifying thunderstorm occurrence in numerical weather prediction (NWP) data. The model is trained on convection-resolving ensemble forecasts over Central Europe and lightning observations. Given only a set of pixel-wise input parameters that are extracted from NWP data and related to thunderstorm development, SALAMA infers the probability of thunderstorm occurrence in a reliably calibrated manner. For lead times up to eleven hours, we find a forecast skill superior to classification based only on NWP reflectivity. Varying the spatiotemporal criteria by which we associate lightning observations with NWP data, we show that the time scale for skillful thunderstorm predictions increases linearly with the spatial scale of the forecast.
