MetMamba: Regional Weather Forecasting with Spatial-Temporal Mamba Model
Haoyu Qin, Yungang Chen, Qianchuan Jiang, Pengchao Sun, Xiancai Ye, Chao Lin
TL;DR
This work addresses accurate regional weather prediction with deep learning by introducing MetMamba, a limited-area DLWP backbone based on the Mamba state-space model. MetMamba processes spatial-temporal data natively and is evaluated alongside Swin- and AFNO-based backbones, with a training regime that couples the local model to a global host via lateral boundary conditions. The results show MetMamba achieves superior or comparable performance to global baselines across most variables, demonstrates reduced artifacts at long lead times, and confirms the viability of DLWP-LAM with global-host coupling. The study underscores the potential of state-space backbones for high-resolution regional forecasting and outlines paths to further improvements through larger datasets and better host models.
Abstract
Deep Learning based Weather Prediction (DLWP) models have been improving rapidly over the last few years, surpassing state of the art numerical weather forecasts by significant margins. While much of the optimization effort is focused on training curriculum to extend forecast range in the global context, two aspects remains less explored: limited area modeling and better backbones for weather forecasting. We show in this paper that MetMamba, a DLWP model built on a state-of-the-art state-space model, Mamba, offers notable performance gains and unique advantages over other popular backbones using traditional attention mechanisms and neural operators. We also demonstrate the feasibility of deep learning based limited area modeling via coupled training with a global host model.
