WSSM: Geographic-enhanced hierarchical state-space model for global station weather forecast
Songru Yang, Zili Liu, Zhenwei Shi, Zhengxia Zou
TL;DR
The paper tackles the challenge of global station weather forecasting with a focus on extreme-event prediction. It introduces Weather State-space Model (WSSM), a novel Mamba-based framework that embeds geographical context and multiscale time-frequency representations within a state-space paradigm. The authors propose three innovations: Geographical encoding to incorporate acquisition time and sensor location, a Hierarchical Bi-Mamba encoder to fuse multi-scale sequences, and a Time-frequency Bi-Mamba block to capture low- and high-frequency dynamics. Experiments on the Weather-5K dataset show that WSSM achieves state-of-the-art overall accuracy and substantially improves extreme-weather forecasting, outperforming Transformer-based methods in SEDI while remaining efficient through continuous-time modeling. This work paves the way for more accurate and scalable global station weather forecasting by integrating domain-specific spatio-temporal structure into a Mamba-based state-space model.
Abstract
Global Station Weather Forecasting (GSWF), a prominent meteorological research area, is pivotal in providing timely localized weather predictions. Despite the progress existing models have made in the overall accuracy of the GSWF, executing high-precision extreme event prediction still presents a substantial challenge. The recent emergence of state-space models, with their ability to efficiently capture continuous-time dynamics and latent states, offer potential solutions. However, early investigations indicated that Mamba underperforms in the context of GSWF, suggesting further adaptation and optimization. To tackle this problem, in this paper, we introduce Weather State-space Model (WSSM), a novel Mamba-based approach tailored for GSWF. Geographical knowledge is integrated in addition to the widely-used positional encoding to represent the absolute special-temporal position. The multi-scale time-frequency features are synthesized from coarse to fine to model the seasonal to extreme weather dynamic. Our method effectively improves the overall prediction accuracy and addresses the challenge of forecasting extreme weather events. The state-of-the-art results obtained on the Weather-5K subset underscore the efficacy of the WSSM
