Table of Contents
Fetching ...

Nonlinear Bayesian Update via Ensemble Kernel Regression with Clustering and Subsampling

Yoonsang Lee

TL;DR

The paper tackles the limitation of ensemble Kalman filters under non-Gaussian priors and nonlinear measurements by introducing a nonlinear Bayesian update that separates the observed and unobserved components. It uses kernel density estimation to perform nonlinear regression of the unobserved state $u$ on the observed state $v$, evaluating the conditional mean $\mu_{u|v}(v)$ at $\hat{v}$ obtained from a Kalman denoising of the measurements, and enhances robustness with subsampling via the Mahalanobis distance and optional unsupervised clustering. The method provides a practical, nonparametric alternative to Gaussian mixtures or particle filters, with a simple posterior covariance treatment for $u$ and a complete algorithm that switches to the linear update when local data are insufficient. Across Lorenz-63, a PDE-constrained Darcy problem, and high-dimensional Lorenz-96 tests, the nonlinear update reduces estimation errors in strongly nonlinear regimes, particularly when subsampling and clustering are combined, highlighting its potential as a robust augmentation to ensemble-based inference in challenging settings.

Abstract

Nonlinear Bayesian update for a prior ensemble is proposed to extend traditional ensemble Kalman filtering to settings characterized by non-Gaussian priors and nonlinear measurement operators. In this framework, the observed component is first denoised via a standard Kalman update, while the unobserved component is estimated using a nonlinear regression approach based on kernel density estimation. The method incorporates a subsampling strategy to ensure stability and, when necessary, employs unsupervised clustering to refine the conditional estimate. Numerical experiments on Lorenz systems and a PDE-constrained inverse problem illustrate that the proposed nonlinear update can reduce estimation errors compared to standard linear updates, especially in highly nonlinear scenarios.

Nonlinear Bayesian Update via Ensemble Kernel Regression with Clustering and Subsampling

TL;DR

The paper tackles the limitation of ensemble Kalman filters under non-Gaussian priors and nonlinear measurements by introducing a nonlinear Bayesian update that separates the observed and unobserved components. It uses kernel density estimation to perform nonlinear regression of the unobserved state on the observed state , evaluating the conditional mean at obtained from a Kalman denoising of the measurements, and enhances robustness with subsampling via the Mahalanobis distance and optional unsupervised clustering. The method provides a practical, nonparametric alternative to Gaussian mixtures or particle filters, with a simple posterior covariance treatment for and a complete algorithm that switches to the linear update when local data are insufficient. Across Lorenz-63, a PDE-constrained Darcy problem, and high-dimensional Lorenz-96 tests, the nonlinear update reduces estimation errors in strongly nonlinear regimes, particularly when subsampling and clustering are combined, highlighting its potential as a robust augmentation to ensemble-based inference in challenging settings.

Abstract

Nonlinear Bayesian update for a prior ensemble is proposed to extend traditional ensemble Kalman filtering to settings characterized by non-Gaussian priors and nonlinear measurement operators. In this framework, the observed component is first denoised via a standard Kalman update, while the unobserved component is estimated using a nonlinear regression approach based on kernel density estimation. The method incorporates a subsampling strategy to ensure stability and, when necessary, employs unsupervised clustering to refine the conditional estimate. Numerical experiments on Lorenz systems and a PDE-constrained inverse problem illustrate that the proposed nonlinear update can reduce estimation errors compared to standard linear updates, especially in highly nonlinear scenarios.

Paper Structure

This paper contains 15 sections, 25 equations, 4 figures, 2 tables, 1 algorithm.

Figures (4)

  • Figure 1: Time series of the posterior errors of $x$, $y$, and $z$.
  • Figure 2: Side-by-side comparison of Darcy flow results.
  • Figure 3: estimation error over iteration
  • Figure 4: Normalized statistical distributions of the posterior error time series for regimes with $F=6$ (left) and $F=8$ (right).