Learning Enhanced Ensemble Filters

Eviatar Bach; Ricardo Baptista; Edoardo Calvello; Bohan Chen; Andrew Stuart

Learning Enhanced Ensemble Filters

Eviatar Bach, Ricardo Baptista, Edoardo Calvello, Bohan Chen, Andrew Stuart

TL;DR

The paper addresses the data-assimilation challenge of accurately inferring high-dimensional states under partial, noisy observations. It introduces measure neural mappings (MNM) and a transformer-based MNMEF framework that learn analysis maps acting on probability measures, enabling parameter sharing across ensemble sizes via a mean-field perspective. The approach extends EnKF by incorporating trainable corrections (gain, inflation, localization) learned through a set-transformer architecture, achieving superior performance on Lorenz96, Kuramoto–Sivashinsky, and Lorenz63 across ensemble sizes and demonstrating efficient fine-tuning. The work offers a principled neural-operator strategy for amortized data assimilation, with potential broad impact for scalable, robust filtering in nonlinear, high-dimensional systems.

Abstract

The filtering distribution in hidden Markov models evolves according to the law of a mean-field model in state-observation space. The ensemble Kalman filter (EnKF) approximates this mean-field model with an ensemble of interacting particles, employing a Gaussian ansatz for the joint distribution of the state and observation at each observation time. These methods are robust, but the Gaussian ansatz limits accuracy. Here this shortcoming is addressed by using machine learning to map the joint predicted state and observation to the updated state estimate. The derivation of methods from a mean field formulation of the true filtering distribution suggests a single parametrization of the algorithm that can be deployed at different ensemble sizes. And we use a mean field formulation of the ensemble Kalman filter as an inductive bias for our architecture. To develop this perspective, in which the mean-field limit of the algorithm and finite interacting ensemble particle approximations share a common set of parameters, a novel form of neural operator is introduced, taking probability distributions as input: a measure neural mapping (MNM). A MNM is used to design a novel approach to filtering, the MNM-enhanced ensemble filter (MNMEF), which is defined in both the mean-field limit and for interacting ensemble particle approximations. The ensemble approach uses empirical measures as input to the MNM and is implemented using the set transformer, which is invariant to ensemble permutation and allows for different ensemble sizes. In practice fine-tuning of a small number of parameters, for specific ensemble sizes, further enhances the accuracy of the scheme. The promise of the approach is demonstrated by its superior root-mean-square-error performance relative to leading methods in filtering the Lorenz '96 and Kuramoto-Sivashinsky models.

Learning Enhanced Ensemble Filters

TL;DR

Abstract

Learning Enhanced Ensemble Filters

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (37)