A Weather Foundation Model for the Power Grid
Cristian Bodnar, Raphaël Rousseau-Rizzi, Nikhil Shankar, James Merleau, Stylianos Flampouris, Guillem Candille, Slavica Antic, François Miralles, Jayesh K. Gupta
TL;DR
This work demonstrates that Weather Foundation Models (WFMs) can be effectively adapted to power-grid operations by post-training a pretrained Generative Forecasting Transformer (GFT) on utility asset data. The resulting GFT-HQ delivers hyper-local forecasts for temperature, precipitation, hub-height wind, wind-farm icing, and rime-ice risk on transmission lines, outperforming strong NWP baselines across multiple lead times and metrics, including an unprecedented average precision of $AP\approx 0.72$ for day-ahead rime-ice detection. The approach achieves significant reductions in traditional forecast errors (e.g., $MAE$ for temperature by ~15%, precipitation by ~35%, wind by ~15%) and enables new operational capabilities such as proactive de-icing and improved dynamic line ratings. Practically, this work shows that minimal post-training data can unlock asset-specific, multivariate forecasts with cross-variable coherence, providing actionable insights for grid resilience and reliability under changing climate and asset portfolios.
Abstract
Weather foundation models (WFMs) have recently set new benchmarks in global forecast skill, yet their concrete value for the weather-sensitive infrastructure that powers modern society remains largely unexplored. In this study, we fine-tune Silurian AI's 1.5B-parameter WFM, Generative Forecasting Transformer (GFT), on a rich archive of Hydro-Québec asset observations--including transmission-line weather stations, wind-farm met-mast streams, and icing sensors--to deliver hyper-local, asset-level forecasts for five grid-critical variables: surface temperature, precipitation, hub-height wind speed, wind-turbine icing risk, and rime-ice accretion on overhead conductors. Across 6-72 h lead times, the tailored model surpasses state-of-the-art NWP benchmarks, trimming temperature mean absolute error (MAE) by 15%, total-precipitation MAE by 35%, and lowering wind speed MAE by 15%. Most importantly, it attains an average precision score of 0.72 for day-ahead rime-ice detection, a capability absent from existing operational systems, which affords several hours of actionable warning for potentially catastrophic outage events. These results show that WFMs, when post-trained with small amounts of high-fidelity, can serve as a practical foundation for next-generation grid-resilience intelligence.
