VAE-Var: Variational-Autoencoder-Enhanced Variational Assimilation
Yi Xiao, Qilong Jia, Wei Xue, Lei Bai
TL;DR
This work tackles the Gaussian-background-error limitation in variational data assimilation by introducing VAE-Var, which learns a non-Gaussian background-error distribution via a variational autoencoder. The authors derive a tractable variational cost in latent space that includes a regularization term and a Jacobian-determinant term to account for the nonlinear mapping between latent and physical spaces. They demonstrate consistent accuracy gains over traditional 3DVar and 4DVar on low-dimensional chaotic systems (Lorenz 63 and Lorenz 96) under both linear and nonlinear observation operators, with ablations confirming the importance of each cost component. The approach shows promise for improved assimilation accuracy, though scalability to high-dimensional systems remains a key limitation for future work.
Abstract
Data assimilation refers to a set of algorithms designed to compute the optimal estimate of a system's state by refining the prior prediction (known as background states) using observed data. Variational assimilation methods rely on the maximum likelihood approach to formulate a variational cost, with the optimal state estimate derived by minimizing this cost. Although traditional variational methods have achieved great success and have been widely used in many numerical weather prediction centers, they generally assume Gaussian errors in the background states, which limits the accuracy of these algorithms due to the inherent inaccuracies of this assumption. In this paper, we introduce VAE-Var, a novel variational algorithm that leverages a variational autoencoder (VAE) to model a non-Gaussian estimate of the background error distribution. We theoretically derive the variational cost under the VAE estimation and present the general formulation of VAE-Var; we implement VAE-Var on low-dimensional chaotic systems and demonstrate through experimental results that VAE-Var consistently outperforms traditional variational assimilation methods in terms of accuracy across various observational settings.
