Table of Contents
Fetching ...

Ensemble Transport Filter via Optimized Maximum Mean Discrepancy

Dengfei Zeng, Lijian Jiang

TL;DR

The paper tackles nonlinear data assimilation in high-dimensional, non-Gaussian settings where traditional EnKF struggles and particle filters suffer from degeneracy. It introduces the Ensemble Transport Filter (EnTranF), which learns a transport map from the forecast prior to the posterior by minimizing a Maximum Mean Discrepancy ($MMD$) loss in an RKHS, and augments this with a variance penalty to emphasize informative statistics. A key advancement is the high-dimensional extension via observation splitting, enabling sequential, lower-dimensional Bayesian inversions that are composed to form the full update. The method is analyzed shows that in the linear-kernel limit it recovers an EnKF-like update, while in nonlinear settings it yields non-Gaussian posterior fidelity, demonstrated across static inverse problems and chaotic Lorenz systems. Overall, EnTranF provides a robust, scalable alternative to EnKF and PF for complex assimilation tasks, with demonstrated improvements in posterior accuracy, uncertainty quantification, and stability.

Abstract

In this paper, we present a new ensemble-based filter method by reconstructing the analysis step of the particle filter through a transport map, which directly transports prior particles to posterior particles. The transport map is constructed through an optimization problem described by the Maximum Mean Discrepancy loss function, which matches the expectation information of the approximated posterior and reference posterior. The proposed method inherits the accurate estimation of the posterior distribution from particle filtering while gives an extension to high dimensional assimilation problems. To improve the robustness of Maximum Mean Discrepancy, a variance penalty term is used to guide the optimization. It prioritizes minimizing the discrepancy between the expectations of highly informative statistics for the reference posteriors. The penalty term significantly enhances the robustness of the proposed method and leads to a better approximation of the posterior. A few numerical examples are presented to illustrate the advantage of the proposed method over ensemble Kalman filter.

Ensemble Transport Filter via Optimized Maximum Mean Discrepancy

TL;DR

The paper tackles nonlinear data assimilation in high-dimensional, non-Gaussian settings where traditional EnKF struggles and particle filters suffer from degeneracy. It introduces the Ensemble Transport Filter (EnTranF), which learns a transport map from the forecast prior to the posterior by minimizing a Maximum Mean Discrepancy () loss in an RKHS, and augments this with a variance penalty to emphasize informative statistics. A key advancement is the high-dimensional extension via observation splitting, enabling sequential, lower-dimensional Bayesian inversions that are composed to form the full update. The method is analyzed shows that in the linear-kernel limit it recovers an EnKF-like update, while in nonlinear settings it yields non-Gaussian posterior fidelity, demonstrated across static inverse problems and chaotic Lorenz systems. Overall, EnTranF provides a robust, scalable alternative to EnKF and PF for complex assimilation tasks, with demonstrated improvements in posterior accuracy, uncertainty quantification, and stability.

Abstract

In this paper, we present a new ensemble-based filter method by reconstructing the analysis step of the particle filter through a transport map, which directly transports prior particles to posterior particles. The transport map is constructed through an optimization problem described by the Maximum Mean Discrepancy loss function, which matches the expectation information of the approximated posterior and reference posterior. The proposed method inherits the accurate estimation of the posterior distribution from particle filtering while gives an extension to high dimensional assimilation problems. To improve the robustness of Maximum Mean Discrepancy, a variance penalty term is used to guide the optimization. It prioritizes minimizing the discrepancy between the expectations of highly informative statistics for the reference posteriors. The penalty term significantly enhances the robustness of the proposed method and leads to a better approximation of the posterior. A few numerical examples are presented to illustrate the advantage of the proposed method over ensemble Kalman filter.
Paper Structure (19 sections, 4 theorems, 55 equations, 20 figures, 1 table, 1 algorithm)

This paper contains 19 sections, 4 theorems, 55 equations, 20 figures, 1 table, 1 algorithm.

Key Result

Lemma 3.1

(Lemma 9.3.2 in richardmRealAnalysisProbability2018) Let $(\Omega, d)$ be a metric space, and $p, q$ be two probability measure defined on $\Omega$. Then $p=q$ if and only if $\mathbb{E}_{\boldsymbol{x}\sim p}[f(\boldsymbol{x})] = \mathbb{E}_{\boldsymbol{y}\sim q}[f(\boldsymbol{y})]$ for all $f\in \

Figures (20)

  • Figure 2.1: Directed probability graph of dynamical model \ref{['eq:dynamical model']} and observation operator \ref{['eq:observation operator']} in the form of state-space model
  • Figure 2.2: Propagation of particles between distributions in ensemble-based filtering
  • Figure 3.3: Schematic of observation split and composition of transport maps generated by each splited observation.
  • Figure 4.4: Posterior approximation of EnTranF with Linear transport map (Panel (a)-(c)) and Nonlinear transport map (Panel (d)-(f)) when taking different bandwidths ($10^{-3}$, $10^{0}$ and $10^{3}$).
  • Figure 4.5: 2-d Histogram of ensemble particles generated by EnKF, EnTranF and EnTranFp. Red "$\times$" represents the MAP estimation of the corresponding method.
  • ...and 15 more figures

Theorems & Definitions (9)

  • Lemma 3.1
  • Proposition 3.2
  • Lemma 3.3
  • Remark 3.1
  • Remark 3.2
  • Theorem 3.4
  • proof
  • Remark 3.3
  • Remark 3.4