FuXi-S2S: A machine learning model that outperforms conventional global subseasonal forecast models

Lei Chen; Xiaohui Zhong; Hao Li; Jie Wu; Bo Lu; Deliang Chen; Shangping Xie; Qingchen Chao; Chensen Lin; Zixin Hu; Yuan Qi

FuXi-S2S: A machine learning model that outperforms conventional global subseasonal forecast models

Lei Chen, Xiaohui Zhong, Hao Li, Jie Wu, Bo Lu, Deliang Chen, Shangping Xie, Qingchen Chao, Chensen Lin, Zixin Hu, Yuan Qi

TL;DR

FuXi-S2S presents a probabilistic subseasonal forecasting model trained on long ERA5 history that outputs global daily means up to 42 days ahead across 16 atmospheric variables. By injecting flow-dependent perturbations into a latent space and distilling target distributions via KL loss, FuXi-S2S achieves higher ensemble-mean accuracy for TP and OLR and extends MJO skill to ~36 days, outperforming ECMWF S2S in multiple metrics. The approach enables large, fast ensembles and reveals interpretable precursor signals (e.g., SST patterns) linked to extreme events such as the 2022 Pakistan floods, suggesting a pathway toward discovery-driven Earth-system insights in subseasonal forecasting. The work highlights the potential of ML-based subseasonal forecasts to surpass traditional EPS in both accuracy and interpretability, while outlining avenues for higher resolution, expanded variables, and coupled data integration to further enhance practical impact.

Abstract

Skillful subseasonal forecasts are crucial for various sectors of society but pose a grand scientific challenge. Recently, machine learning based weather forecasting models outperform the most successful numerical weather predictions generated by the European Centre for Medium-Range Weather Forecasts (ECMWF), but have not yet surpassed conventional models at subseasonal timescales. This paper introduces FuXi Subseasonal-to-Seasonal (FuXi-S2S), a machine learning model that provides global daily mean forecasts up to 42 days, encompassing five upper-air atmospheric variables at 13 pressure levels and 11 surface variables. FuXi-S2S, trained on 72 years of daily statistics from ECMWF ERA5 reanalysis data, outperforms the ECMWF's state-of-the-art Subseasonal-to-Seasonal model in ensemble mean and ensemble forecasts for total precipitation and outgoing longwave radiation, notably enhancing global precipitation forecast. The improved performance of FuXi-S2S can be primarily attributed to its superior capability to capture forecast uncertainty and accurately predict the Madden-Julian Oscillation (MJO), extending the skillful MJO prediction from 30 days to 36 days. Moreover, FuXi-S2S not only captures realistic teleconnections associated with the MJO, but also emerges as a valuable tool for discovering precursor signals, offering researchers insights and potentially establishing a new paradigm in Earth system science research.

FuXi-S2S: A machine learning model that outperforms conventional global subseasonal forecast models

TL;DR

Abstract

Paper Structure (14 sections, 13 equations, 5 figures, 1 table)

This paper contains 14 sections, 13 equations, 5 figures, 1 table.

Introduction
Results
Deterministic metrics
Probabilistic metrics
Extreme forecast
MJO forecast
Prediction of the 2022 Pakistan floods
Discovery of precursor signals for the 2022 Pakistan floods prediction
Discussion
Methods
Data
FuXi-S2S model
Saliency map
Evaluation method

Figures (5)

Figure 1: Comparison of globally-averaged and latitude-weighted temporal anomaly correlation coefficient ($\textrm{TCC}$) of the ensemble mean between ECMWF subseasonal-to-seasonal (S2S) reforecasts (in blue) and FuXi-S2S forecasts (in red) for total precipitation ($\textrm{TP}$), 2-meter temperature ($\textrm{T2M}$), geopotential at 500 hPa ($\textrm{Z500}$), and outgoing longwave radiation ($\textrm{OLR}$). Rows 1 and 2 represent the performance across these variables, utilizing all testing data from the period spanning from 2017 to 2021. A bootstrapping approach, repeated 1000 times, is used for significance testing. When the FuXi-S2S forecasts fail to show a statistically significant improvement over the ECMWF S2S reforecasts at the 97.5% confidence level, a pale color scheme is used to denote these results.
Figure 2: Maps displaying the average Ranked Probability skill Score ($\textrm{RPSS}$) (first and second rows) and Brier Skill Score ($\textrm{BSS}$) (third and fourth rows) without latitude weighting, comparing ECMWF subseasonal-to-seasonal (S2S) (first column) and FuXi-S2S (second column) forecasts. Additionally, the third column depicts the difference in $\textrm{RPSS}$ and $\textrm{BSS}$ between FuXi-S2S and ECMWF S2S for total precipitation ($\textrm{TP}$) at forecast lead times of weeks 3-4 (first and third rows) and weeks 5-6 (second and fourth rows), utilizing all testing data from 2017 to 2021. Red contour lines in the first and second columns indicate areas with positive values of $\textrm{RPSS}$ and $\textrm{BSS}$. Stippling on the map denotes areas where the skill score is statistically significant at the 97.5% confidence level. Specifically, in columns 1 and 2, stippling indicates regions where the skill scores of the ECMWF S2S and FuXi-S2S models significantly surpasses those of climatology. In column 3, stippling highlights areas where the FuXi-S2S model significantly outperforms the ECMWF S2S.
Figure 3: Comparison of real-time multivariate Madden–Julian Oscillation (MJO) (RMM) bivariate Correlation ($\textrm{COR}$) of the ensemble mean between ECMWF subseasonal-to-seasonal (S2S) reforecasts (in blue) and FuXi-S2S forecasts (in red) using all testing data from 2017 to 2021. $\textbf{a}$) Comparison of RMM bivariate $\textrm{COR}$ as a function of forecast lead times. Dashed black line signifies the prediction skill threshold of $\textrm{COR}$=0.5. $\textbf{b}$) The RMM bivariate $\textrm{COR}$ is depicted as a function of the month of initialization (x-axis) and forecast lead time (y-axis), with red and blue lines indicating the skillful MJO prediction days of ECMWF S2S (in blue) and FuXi-S2S (in red), respectively.
Figure 4: Comparative analysis for the 2022 Pakistan floods predictions between the ECMWF subseasonal-to-seasonal (S2S) and FuXi-S2S models as well as the precursor signals that contributed to accurate predictions by the FuXi-S2S model. Comparison of spatially and temporally averaged standardized total precipitation ($\textrm{TP}$) anomaly (a) over the two weeks from August 16th to August 31st, 2022, showcasing GPCP observations (in black) alongside predictions from ECMWF S2S real-time forecasts (in blue) and FuXi-S2S forecasts (in red), with initialization dates: August 11th (08-11, MM-DD), August 8th (08-08), August 4th (08-04), August 1st (08-01), July 28th (07-28), July 25th (07-25), and July 21st (07-21). The black lines on the bar of ECMWF S2S and FuXi-S2S forecasts represent the 25th and 75th percentiles. For the comparison of temporally averaged standardized $\textrm{TP}$ anomaly maps (b), the first column represents GPCP observations, while the second and third columns display predictions from ECMWF S2S and FuXi-S2S, respectively, both initialized on July 28th, and the fourth and fifth columns correspond to predictions from ECMWF S2S and FuXi-S2S, respectively, with an initialization date of July 21st. Green contour indicates the border line of Pakistan. The saliency maps (c) were generated using the gradient of the negative standardized $\textrm{TP}$ anomaly, averaged over the Pakistan region, in relation to the input $\textrm{SST}$. These maps correspond to forecasts initialized on July 28th (07-28, first column) and July 21st (07-21, second column). Here, the red and blue colors indicate the positive and negative correlations between the negative of standardized $\textrm{TP}$ and variations in $\textrm{SST}$. The black lines on the bars in Figure 4 represent the 25th and 75th percentiles of the ensemble forecasts for each start date for both ECMWF and FuXi-S2S models.
Figure 5: Schematic diagram of the structures of the FuXi Subseasonal-to-Seasonal (FuXi-S2S) model. $\textbf{a}$) Inference stage of the FuXi-S2S model. $\textrm{h}^t$ represents the hidden feature generated by the Encoder from the input data. The perturbation vector $\textrm{z}^t$ is generated by the perturbation module, resulting in the perturbed hidden feature $\tilde{\textrm{h}}^t$. $\textbf{b}$) Training stage of the FuXi-S2S model. ${\textrm{N($\Theta$}^t}_p)$ and ${\textrm{N($\Theta$}^t}_q)$ are the low-rank multivariate Gaussian distributions generated by encoders $\textrm{P}$ and $\textrm{Q}$, respectively. The Kullback–Leibler (KL) divergence loss measures the discrepancy between the distributions predicted by both encoders, ${\textrm{N($\Theta$}^t}_p)$ and ${\textrm{N($\Theta$}^t}_q)$.

FuXi-S2S: A machine learning model that outperforms conventional global subseasonal forecast models

TL;DR

Abstract

FuXi-S2S: A machine learning model that outperforms conventional global subseasonal forecast models

Authors

TL;DR

Abstract

Table of Contents

Figures (5)