XiChen: An observation-scalable fully AI-driven global weather forecasting system with 4D variational knowledge
Wuxin Wang, Weicheng Ni, Lilan Huang, Tao Hao, Ben Fei, Shuo Ma, Taikang Yuan, Yanlai Zhao, Kefeng Deng, Xiaoyong Li, Hongze Leng, Boheng Duan, Lei Bai, Weimin Zhang, Kaijun Ren, Junqiang Song
TL;DR
XiChen introduces a fully AI-driven global weather forecasting system that scales assimilation across observation types by integrating a foundation model with 4DVar knowledge. The approach delivers a 15-second end-to-end pipeline on a single GPU and achieves 8.75+ day forecast lead times with accuracy comparable to operational NWP systems, while maintaining physical balance during DA. It demonstrates robust performance across a year-long DA cycle, 10-day forecasts, and tropical cyclone tracks, aided by a cascaded DA framework and selective fine-tuning of observation operators and DA components. While presenting a powerful complementary tool to NWP, XiChen acknowledges limitations such as smoothing tendencies, resolution constraints, and reliance on reanalysis data for training, outlining future directions toward higher resolution, more observational diversity, and probabilistic forecasting.
Abstract
Artificial intelligence (AI)-driven models have the potential to revolutionize weather forecasting, but still rely on initial conditions generated by costly Numerical Weather Prediction (NWP) systems. Although recent end-to-end forecasting models attempt to bypass NWP systems, these methods lack scalable assimilation of new types of observational data. Here, we introduce XiChen, an observation-scalable fully AI-driven global weather forecasting system, wherein the entire pipeline, from Data Assimilation (DA) to medium-range forecasting, can be accomplished within only 15 seconds. XiChen is built upon a foundation model that is pre-trained for weather forecasting and subsequently fine-tuned to serve as both observation operators and DA models, thereby enabling the scalable assimilation of conventional and raw satellite observations. Furthermore, the integration of Four-Dimensional Variational (4DVar) knowledge ensures XiChen to achieve DA and medium-range forecasting accuracy comparable to operational NWP systems, with skillful forecasting lead time beyond 8.75 days. A key feature of XiChen is its ability to maintain physical balance constraints during DA, enabling observed variables to correct unobserved ones effectively. In single-point perturbation DA experiments, XiChen exhibits flow-dependent characteristics similar to those of traditional 4DVar systems. These results demonstrate that XiChen holds strong potential for fully AI-driven weather forecasting independent of NWP systems.
