Exact nonlinear state estimation
Hristo G. Chipilski
TL;DR
This work addresses biases in traditional data assimilation caused by Gaussian assumptions by introducing the Conjugate Transform Filter (CTF), a nonlinear state-space formulation that propagates Gaussian priors through invertible, learnable maps to produce non-Gaussian but tractable posteriors. An ensemble counterpart, ECTF, leverages an EnKF-like update in a latent Gaussian space, enabling principled, non-Gaussian updates while preserving Bayesian consistency. Theoretical results show that CTF generalizes the Kalman filter (and its affine variants) and yields exact nonlinear filtering in the transformed space, while numerical experiments demonstrate that ECTF outperforms the standard EnKF—especially when observations are accurate and state variables are strongly nonlinear or highly correlated. Overall, the framework provides a bridge between traditional DA and AI-based, nonparametric approaches, with practical implications for high-dimensional Earth system modeling and beyond.
Abstract
The majority of data assimilation (DA) methods in the geosciences are based on Gaussian assumptions. While these assumptions facilitate efficient algorithms, they cause analysis biases and subsequent forecast degradations. Non-parametric, particle-based DA algorithms have superior accuracy, but their application to high-dimensional models still poses operational challenges. Drawing inspiration from recent advances in the field of generative artificial intelligence (AI), this article introduces a new nonlinear estimation theory which attempts to bridge the existing gap in DA methodology. Specifically, a Conjugate Transform Filter (CTF) is derived and shown to generalize the celebrated Kalman filter to arbitrarily non-Gaussian distributions. The new filter has several desirable properties, such as its ability to preserve statistical relationships in the prior state and convergence to highly accurate observations. An ensemble approximation of the new theory (ECTF) is also presented and validated using idealized statistical experiments that feature bounded quantities with non-Gaussian distributions, a prevalent challenge in Earth system models. Results from these experiments indicate that the greatest benefits from ECTF occur when observation errors are small relative to the forecast uncertainty and when state variables exhibit strong nonlinear dependencies. Ultimately, the new filtering theory offers exciting avenues for improving conventional DA algorithms through their principled integration with AI techniques.
