CirT: Global Subseasonal-to-Seasonal Forecasting with Geometry-inspired Transformer
Yang Liu, Zinan Zheng, Jiashun Cheng, Fugee Tsung, Deli Zhao, Yu Rong, Jia Li
TL;DR
Subseasonal-to-Seasonal forecasting remains challenging due to atmospheric chaos and geometric distortions when treating the globe as a planar image. CirT addresses this by partitioning the graticule into equidistant latitudinal circular patches and performing self-attention in the frequency domain via a Fourier transform, enabling global, periodic spatial coupling. Direct biweekly prediction on ERA5 demonstrates CirT outperforms state-of-the-art data-driven models and skillful numerical systems, with ablations validating the importance of circular patching and frequency-domain mixing. The approach offers a geometry-aware pathway for robust global S2S forecasting and points to extensions incorporating vertical coupling and slow-evolving Earth system components.
Abstract
Accurate Subseasonal-to-Seasonal (S2S) climate forecasting is pivotal for decision-making including agriculture planning and disaster preparedness but is known to be challenging due to its chaotic nature. Although recent data-driven models have shown promising results, their performance is limited by inadequate consideration of geometric inductive biases. Usually, they treat the spherical weather data as planar images, resulting in an inaccurate representation of locations and spatial relations. In this work, we propose the geometric-inspired Circular Transformer (CirT) to model the cyclic characteristic of the graticule, consisting of two key designs: (1) Decomposing the weather data by latitude into circular patches that serve as input tokens to the Transformer; (2) Leveraging Fourier transform in self-attention to capture the global information and model the spatial periodicity. Extensive experiments on the Earth Reanalysis 5 (ERA5) reanalysis dataset demonstrate our model yields a significant improvement over the advanced data-driven models, including PanguWeather and GraphCast, as well as skillful ECMWF systems. Additionally, we empirically show the effectiveness of our model designs and high-quality prediction over spatial and temporal dimensions.
