Ensemble Kalman filter in latent space using a variational autoencoder pair
Ivo Pasmans, Yumeng Chen, Tobias Sebastian Finn, Marc Bocquet, Alberto Carrassi
TL;DR
This work addresses the challenge of non-Gaussian errors and constrained dynamics in data assimilation by performing ETKF updates in the latent space of variational autoencoders. A single-VAE approach confines ensemble members to a physically meaningful manifold, while a double-VAE variant targets observational innovations to mitigate non-Gaussian bias. Online retraining (transfer learning) of the first VAE is shown to be essential when the underlying manifold changes over time, and the second VAE provides robustness to non-Gaussian observation errors, particularly under strong skewness. Overall, latent-space ETKF-VAEs improve distributional fidelity and physical consistency compared to standard ETKF, with practical implications for complex geophysical systems such as sea-ice models.
Abstract
Popular (ensemble) Kalman filter data assimilation (DA) approaches assume that the errors in both the a priori estimate of the state and those in the observations are Gaussian. For constrained variables, e.g. sea ice concentration or stress, such an assumption does not hold. The variational autoencoder (VAE) is a machine learning (ML) technique that allows to map an arbitrary distribution to/from a latent space in which the distribution is supposedly closer to a Gaussian. We propose a novel hybrid DA-ML approach in which VAEs are incorporated in the DA procedure. Specifically, we introduce a variant of the popular ensemble transform Kalman filter (ETKF) in which the analysis is applied in the latent space of a single VAE or a pair of VAEs. In twin experiments with a simple circular model, whereby the circle represents an underlying submanifold to be respected, we find that the use of a VAE ensures that a posteri ensemble members lie close to the manifold containing the truth. Furthermore, online updating of the VAE is necessary and achievable when this manifold varies in time, i.e. when it is non-stationary. We demonstrate that introducing an additional second latent space for the observational innovations improves robustness against detrimental effects of non-Gaussianity and bias in the observational errors but it slightly lessens the performance if observational errors are strictly Gaussian.
