Mitigating Time Discretization Challenges with WeatherODE: A Sandwich Physics-Driven Neural ODE for Weather Forecasting
Peiyuan Liu, Tian Zhou, Liang Sun, Rong Jin
TL;DR
WeatherODE tackles time-discretization errors and evolving source discrepancies in weather forecasting by embedding physics into a one-stage neural ODE. It couples a wave-equation–informed velocity model, a slower-converging ViT-based advection ODE, and a time-dependent source model within a CNN–ViT–CNN sandwich, trained with multi-task supervision across intermediate steps. The approach yields substantial RMSE gains over state-of-the-art baselines on global and regional ERA5 forecasts and demonstrates flexible inference with a single 24-hour model. By combining physical principles with tailored architectural biases, WeatherODE offers a scalable, accurate framework for hybrid weather prediction with improved stability and generalization.
Abstract
In the field of weather forecasting, traditional models often grapple with discretization errors and time-dependent source discrepancies, which limit their predictive performance. In this paper, we present WeatherODE, a novel one-stage, physics-driven ordinary differential equation (ODE) model designed to enhance weather forecasting accuracy. By leveraging wave equation theory and integrating a time-dependent source model, WeatherODE effectively addresses the challenges associated with time-discretization error and dynamic atmospheric processes. Moreover, we design a CNN-ViT-CNN sandwich structure, facilitating efficient learning dynamics tailored for distinct yet interrelated tasks with varying optimization biases in advection equation estimation. Through rigorous experiments, WeatherODE demonstrates superior performance in both global and regional weather forecasting tasks, outperforming recent state-of-the-art approaches by significant margins of over 40.0\% and 31.8\% in root mean square error (RMSE), respectively. The source code is available at \url{https://github.com/DAMO-DI-ML/WeatherODE}.
