Weakly-Constrained 4D Var for Downscaling with Uncertainty using Data-Driven Surrogate Models
Philip Dinenis, Vishwas Rao, Mihai Anitescu
TL;DR
This work presents a framework that stabilizes fast data-driven downscaling with FourCastNet by embedding it in a weakly constrained $4DVar$ data assimilation scheme. It introduces a Gaussian model-error term, estimates a diagonal $\mathbf{Q}$, and uses LBFGS with automatic differentiation to recover high-resolution trajectories while providing posterior uncertainty through a Laplace approximation. The approach is tested on ERA5 hurricane data (Hurricane Michael), showing improved forecast accuracy and uncertainty quantification over EnKF and unassimilated FourCastNet, and it demonstrates the ability to recover fine-scale features and track extremes more reliably. The results suggest substantial potential for real-time, high-resolution weather forecasting and risk assessment, with avenues to extend to longer horizons, other surrogates, and regionally varying resolutions.
Abstract
Dynamic downscaling typically involves using numerical weather prediction (NWP) solvers to refine coarse data to higher spatial resolutions. Data-driven models such as FourCastNet have emerged as a promising alternative to the traditional NWP models for forecasting. Once these models are trained, they are capable of delivering forecasts in a few seconds, thousands of times faster compared to classical NWP models. However, as the lead times, and, therefore, their forecast window, increase, these models show instability in that they tend to diverge from reality. In this paper, we propose to use data assimilation approaches to stabilize them when used for downscaling tasks. Data assimilation uses information from three different sources, namely an imperfect computational model based on partial differential equations (PDE), from noisy observations, and from an uncertainty-reflecting prior. In this work, when carrying out dynamic downscaling, we replace the computationally expensive PDE-based NWP models with FourCastNet in a ``weak-constrained 4DVar framework" that accounts for the implied model errors. We demonstrate the efficacy of this approach for a hurricane-tracking problem; moreover, the 4DVar framework naturally allows the expression and quantification of uncertainty. We demonstrate, using ERA5 data, that our approach performs better than the ensemble Kalman filter (EnKF) and the unstabilized FourCastNet model, both in terms of forecast accuracy and forecast uncertainty.
