Global Tropical Cyclone Intensity Forecasting with Multi-modal Multi-scale Causal Autoregressive Model
Xinyu Wang, Kang Chen, Lei Liu, Tao Han, Bin Li, Lei Bai
TL;DR
This paper tackles the challenge of global tropical cyclone intensity forecasting by proposing MSCAR, a multimodal, multi-scale autoregressive model that injects causal temporal relationships via CC Attention and fuses satellite and ERA5 data through a Feature Pyramid Network. It introduces SETCD, the longest global TC dataset with 72 variables spanning 1980–2022, enabling robust learning across scales and environments. Experiments on SETCD demonstrate state-of-the-art short-term forecasting globally and regionally, with substantial error reductions and strong real-time robustness when ERA5 is replaced by analysis data. The work provides publicly available code and data, facilitating broader adoption and future TC research, and highlights the practical impact of integrating causal, multimodal information for disaster risk reduction.
Abstract
Accurate forecasting of Tropical cyclone (TC) intensity is crucial for formulating disaster risk reduction strategies. Current methods predominantly rely on limited spatiotemporal information from ERA5 data and neglect the causal relationships between these physical variables, failing to fully capture the spatial and temporal patterns required for intensity forecasting. To address this issue, we propose a Multi-modal multi-Scale Causal AutoRegressive model (MSCAR), which is the first model that combines causal relationships with large-scale multi-modal data for global TC intensity autoregressive forecasting. Furthermore, given the current absence of a TC dataset that offers a wide range of spatial variables, we present the Satellite and ERA5-based Tropical Cyclone Dataset (SETCD), which stands as the longest and most comprehensive global dataset related to TCs. Experiments on the dataset show that MSCAR outperforms the state-of-the-art methods, achieving maximum reductions in global and regional forecast errors of 9.52% and 6.74%, respectively. The code and dataset are publicly available at https://anonymous.4open.science/r/MSCAR.
