Local wind speed forecasting at short time horizons based on Numerical Weather Prediction and observations from surrounding stations
Roberta Baggio, Killian Pujol, Florian Pantillon, Dominique Lambert, Jean-Baptiste Filippi, Jean-François Muzy
TL;DR
The paper addresses the challenge of accurate short-term wind speed forecasting in complex terrain by blending Numerical Weather Prediction outputs (ARPEGE and AROME) with nearby ground-station observations through a hybrid neural network. It introduces deterministic and probabilistic forecasting via a three-branch architecture and an M-Rice distribution to capture uncertainty and extreme events, achieving up to ~30% RMSE improvement over raw NWP baselines. The study demonstrates that the hybrid approach outperforms baselines across 278 stations and that probabilistic forecasts further enhance extreme-event prediction, with notable gains from fine-tuning at Corsican sites. Operational feasibility is highlighted through a low-latency inference pipeline, supporting real-time wind energy and safety applications, and the work points to future extensions including ensemble-NWP, hub-height extrapolation, and transformer-based spatiotemporal models.
Abstract
This study presents a hybrid neural network model for short-term (1-6 hours ahead) surface wind speed forecasting, combining Numerical Weather Prediction (NWP) with observational data from ground weather stations. It relies on the MeteoNet dataset, which includes data from global (ARPEGE) and regional (AROME) NWP models of the French weather service and meteorological observations from ground stations in the French Mediterranean. The proposed neural network architecture integrates recent past station observations (over last few hours) and AROME and ARPEGE predictions on a small subgrid around the target location. The model is designed to provide both deterministic and probabilistic forecasts, with the latter predicting the parameters of a suitable probability distribution that notably allows us to capture extreme wind events. Our results demonstrate that the hybrid model significantly outperforms baseline methods, including raw NWP predictions, persistence models, and linear regression, across all forecast horizons. For instance, the model reduces RMSE by up 30\% compared to AROME predictions. Probabilistic forecasting further enhances performance, particularly for extreme quantiles, by estimating conditional quantiles rather than relying solely on the conditional mean. Fine-tuning the model for specific stations, such as those in the Mediterranean island of Corsica, further improves forecasting accuracy. Our study highlights the importance of integrating multiple data sources and probabilistic approaches to improve short-term wind speed forecasting. It defines an effective approach, even in a complex terrain like Corsica where localized wind variations are significant
