Model Error Covariance Estimation for Weak Constraint Data Assimilation
Sandra R. Babyale, Jodi Mead, Donna Calhoun, Patricia O. Azike
TL;DR
This work addresses the challenge of estimating model-error covariances in weakly constrained 4D-Var data assimilation by recasting the problem as a regularized inverse problem in which the inverse model-error covariance acts as the regularization operator. The authors employ the representer method to reduce the state-space search to data-space, enabling efficient use of standard regularization-parameter selection techniques (L-curve, generalized cross-validation, and chi-square) to estimate hyperparameters for both isotropic and non-isotropic covariances. Through a series of 1D wildfire PM$_{2.5}$ transport experiments, they show that isotropic covariances suffice when the first guess is more reliable, while non-isotropic, spatiotemporally correlated covariances offer clear improvements when observations are more trustworthy. The approach yields consistent hyperparameter estimates and practical computational costs, suggesting potential extensions to jointly estimate other covariances (e.g., initial and boundary conditions) and to higher-resolution models for real-time applications.
Abstract
State estimates from weak constraint 4D-Var data assimilation can vary significantly depending on the data and model error covariances. As a result, the accuracy of these estimates heavily depends on the correct specification of both model and observational data error covariances. In this work, we assume that the data error is known and and focus on estimating the model error covariance by framing weak constraint 4D-Var as a regularized inverse problem, where the inverse model error covariance serves as the regularization matrix. We consider both isotropic and non-isotropic forms of the model error covariance. Using the representer method, we reduce the 4D-Var problem from state space to data space, enabling the efficient application of regularization parameter selection techniques. The Representer method also provides an analytic expression for the optimal state estimate, allowing us to derive matrix expressions for the three regularization parameter selection methods i.e. the L-curve, generalized cross-validation (GCV), and the Chi-square method. We validate our approach by assimilating simulated data into a 1D transport equation modeling wildfire smoke transport under various observational noise and forward model perturbations. In these experiments the goal is to identify the model error covariances that accurately capture the influence of observational data versus model predictions on assimilated state estimates. The regularization parameter selection methods successfully estimate hyperparameters for both isotropic and non-isotropic model error covariances, that reflect whether the first guess model predictions are more or less reliable than the observational data. The results further indicate that isotropic variances are sufficient when the first guess is more accurate than the data whereas non-isotropic covariances are preferred when the observational data is more reliable.
