Table of Contents
Fetching ...

High-dimensional Bayesian filtering through deep density approximation

Kasper Bågmark, Filip Rydin

TL;DR

This work tackles high-dimensional nonlinear state estimation by recasting nonlinear filtering through the Fokker–Planck equation with Bayes updates and comparing two deep density methods, the Deep Splitting Filter (DSF) and the Deep BSDE Filter (BSDEF), along with their log-density variants. The authors derive log-density formulations to stabilize training and demonstrate a unified simulation-based framework using Euler–Maruyama discretization and importance-sampling normalization, benchmarking these methods against EKF, EnKF, and bootstrap PF across a spectrum of problems up to 100 dimensions. They show that LogBSDEF remains accurate and robust in challenging regimes, including chaotic Lorenz-96 at $d=100$, while particle methods fail or become prohibitively expensive; in linear settings, the deep density methods achieve competitive accuracy, and across nonlinear cases they maintain numerical stability with substantial computational speedups (roughly two to five orders of magnitude) over particle-based filters. The study highlights practical advantages for high-dimensional data assimilation, with clear guidance on when log-density formulations and BSDE-based training offer the most benefit, and discusses limitations related to architecture choice and training stability. Overall, the paper advances scalable, density-based nonlinear filtering by delivering accurate, fast, and robust deep learning-based alternatives to classical methods in high dimensions.

Abstract

In this work, we benchmark two recently developed deep density methods for nonlinear filtering. Starting from the Fokker--Planck equation with Bayes updates, we model the filtering density of a discretely observed SDE. The two filters: the deep splitting filter and the deep BSDE filter, are both based on Feynman--Kac formulas, Euler--Maruyama discretizations and neural networks. The two methods are extended to logarithmic formulations providing sound and robust implementations in increasing state dimension. Comparing to the classical particle filters and ensemble Kalman filters, we benchmark the methods on numerous examples. In the low-dimensional examples the particle filters work well, but when we scale up to a partially observed 100-dimensional Lorenz-96 model the particle-based methods fail and the logarithmic deep density method prevails. In terms of computational efficiency, the deep density methods reduce inference time by roughly two to five orders of magnitude relative to the particle-based filters.

High-dimensional Bayesian filtering through deep density approximation

TL;DR

This work tackles high-dimensional nonlinear state estimation by recasting nonlinear filtering through the Fokker–Planck equation with Bayes updates and comparing two deep density methods, the Deep Splitting Filter (DSF) and the Deep BSDE Filter (BSDEF), along with their log-density variants. The authors derive log-density formulations to stabilize training and demonstrate a unified simulation-based framework using Euler–Maruyama discretization and importance-sampling normalization, benchmarking these methods against EKF, EnKF, and bootstrap PF across a spectrum of problems up to 100 dimensions. They show that LogBSDEF remains accurate and robust in challenging regimes, including chaotic Lorenz-96 at , while particle methods fail or become prohibitively expensive; in linear settings, the deep density methods achieve competitive accuracy, and across nonlinear cases they maintain numerical stability with substantial computational speedups (roughly two to five orders of magnitude) over particle-based filters. The study highlights practical advantages for high-dimensional data assimilation, with clear guidance on when log-density formulations and BSDE-based training offer the most benefit, and discusses limitations related to architecture choice and training stability. Overall, the paper advances scalable, density-based nonlinear filtering by delivering accurate, fast, and robust deep learning-based alternatives to classical methods in high dimensions.

Abstract

In this work, we benchmark two recently developed deep density methods for nonlinear filtering. Starting from the Fokker--Planck equation with Bayes updates, we model the filtering density of a discretely observed SDE. The two filters: the deep splitting filter and the deep BSDE filter, are both based on Feynman--Kac formulas, Euler--Maruyama discretizations and neural networks. The two methods are extended to logarithmic formulations providing sound and robust implementations in increasing state dimension. Comparing to the classical particle filters and ensemble Kalman filters, we benchmark the methods on numerous examples. In the low-dimensional examples the particle filters work well, but when we scale up to a partially observed 100-dimensional Lorenz-96 model the particle-based methods fail and the logarithmic deep density method prevails. In terms of computational efficiency, the deep density methods reduce inference time by roughly two to five orders of magnitude relative to the particle-based filters.

Paper Structure

This paper contains 32 sections, 1 theorem, 48 equations, 14 figures, 2 tables.

Key Result

Theorem 2.1

Let $p$ be the solution to eq: FP-prediction--eq: FP-update. Define $v=(v_k)_{k=0}^K$, $v_k\colon [t_k,t_{k+1}]\times \mathbb{R}^d\times \mathbb{O}_k\to \mathbb{R}$, such that $v_k = -\log p_k$, for $k=0,\dots,K$. The log density $v$, satisfying $v_{0}(0)=-\log(\pi_0)$, is, for $k=0,\dots,K-1$, $x\i where $f_{\mathrm{log}}$ is defined, for $x\in\mathbb{R}^d$, $u\in\mathbb{R}$, and $w\in\mathbb{R}^

Figures (14)

  • Figure 1: On the left and right panels the results for the Ornstein--Uhlenbeck process and the bistable process are depicted, respectively. From top to bottom the rMAE, FME, and KLD metrics are illustrated.
  • Figure 2: On the left and right panels the results for the long-horizon $10$-dimensional Ornstein--Uhlenbeck process and the short-horizon $100$-dimensional Ornstein--Uhlenbeck process are depicted respectively. From top to bottom the rMAE, FME, and KLD metrics are illustrated.
  • Figure 3: On the left and right panels the results for the $10$-dimensional and $100$-dimensional linear spring-mass models are depicted respectively. From top to bottom the rMAE, FME, and KLD metrics are illustrated.
  • Figure 4: Metrics for the Schlögl model, shown left to right: rMAE, FME, and KLD.
  • Figure 5: Metrics for the four-dimensional Lorenz-96 model, shown left to right: rMAE, FME, and KLD.
  • ...and 9 more figures

Theorems & Definitions (1)

  • Theorem 2.1