Ensemble-localized Kernel Density Estimation with Applications to the Ensemble Gaussian Mixture Filter
Andrey A. Popov, Enrico M. Zucchelli, Renato Zanetti
TL;DR
This work addresses non-Gaussian state estimation by introducing E-localization for kernel density estimation (ELKDE) and applying it to the ensemble Gaussian mixture filter to form the ELEnGMF. ELKDE computes local covariance structures around each particle, recovering the canonical KDE behavior in the Gaussian case while better capturing local density in non-Gaussian settings, thereby improving both prior and posterior estimates. The authors demonstrate strong improvements on a non-Gaussian bivariate spiral and show reduced RMSE and less conservative uncertainty in Lorenz '63 sequential filtering, indicating practical gains for online data assimilation. The approach offers a scalable density-estimation enhancement for nonlinear, non-Gaussian systems with limited samples, with future work on adaptive localization and robust projection strategies.
Abstract
The ensemble Gaussian mixture filter (EnGMF) is a non-linear filter suited to data assimilation of highly non-Gaussian and non-linear models that has practical utility in the case of a small number of samples, and theoretical convergence to full Bayesian inference in the ensemble limit. We aim to increase the utility of the EnGMF by introducing an ensemble-local notion of covariance into the kernel density estimation (KDE) step for the prior distribution. We prove that in the Gaussian case, our new ensemble-localized KDE technique is exactly the same as more traditional KDE techniques. We also show an example of a non-Gaussian distribution that can fail to be approximated by canonical KDE methods, but can be approximated well by our new KDE technique. We showcase our new KDE technique on a simple bivariate problem, showing that it has nice qualitative and quantitative properties, and significantly improves the estimate of the prior and posterior distributions for all ensemble sizes tested. We additionally show the utility of the proposed methodology for sequential filtering for the Lorenz '63 equations, achieving a significant reduction in error, and less conservative behavior in the uncertainty estimate with respect to traditional techniques.
