Dropout Ensemble Kalman inversion for high dimensional inverse problems

Shuigen Liu; Sebastian Reich; Xin T. Tong

Dropout Ensemble Kalman inversion for high dimensional inverse problems

Shuigen Liu, Sebastian Reich, Xin T. Tong

TL;DR

The paper introduces dropout ensemble Kalman inversion (DEKI) to overcome the subspace limitation of standard EKI in high-dimensional inverse problems. By employing dropout on ensemble deviations and a mean–deviation separated dynamics with adaptive steps and a truncated linearization, DEKI achieves exponential convergence under suitable conditions and scales linearly with the problem dimension, addressing computational bottlenecks in large-scale settings. A rigorous analysis establishes ensemble collapse control, Gauss–Newton–like convergence, and near-optimal zeroth-order query complexity; numerical tests on linear transport and Darcy’s law demonstrate robust performance with small ensembles where vanilla EKI fails. The approach offers a practical, scalable framework for derivative-free inversion with strong theoretical guarantees and broad applicability in high-dimensional PDE-constrained problems.

Abstract

Ensemble Kalman inversion (EKI) is an ensemble-based method to solve inverse problems. Its gradient-free formulation makes it an attractive tool for problems with involved formulation. However, EKI suffers from the ''subspace property'', i.e., the EKI solutions are confined in the subspace spanned by the initial ensemble. It implies that the ensemble size should be larger than the problem dimension to ensure EKI's convergence to the correct solution. Such scaling of ensemble size is impractical and prevents the use of EKI in high dimensional problems. To address this issue, we propose a novel approach using dropout regularization to mitigate the subspace problem. We prove that dropout-EKI converges in the small ensemble settings, and the computational cost of the algorithm scales linearly with dimension. We also show that dropout-EKI reaches the optimal query complexity, up to a constant factor. Numerical examples demonstrate the effectiveness of our approach.

Dropout Ensemble Kalman inversion for high dimensional inverse problems

TL;DR

Abstract

Paper Structure (28 sections, 11 theorems, 159 equations, 4 figures, 1 table, 1 algorithm)

This paper contains 28 sections, 11 theorems, 159 equations, 4 figures, 1 table, 1 algorithm.

Introduction
Notations
Dropout EKI scheme
Review: EKI and subspace property
Naive DEKI
DEKI with mean--deviation separation
Related literature
EKI methodology
Dropout in deep learning
Localized EKI
Analysis of DEKI
Assumptions
Controllable ensemble collapse
Convergence to optimal solution
Query complexity analysis
...and 13 more sections

Key Result

Proposition 3.3

\newlabelprop:ensemble_collapse0 Consider the DEKI scheme for the regularized problem eq:RegProblem. Assume that the linearized map $H_n$ satisfies $\gamma^2 I \preceq H_n^{\rm T} H_n \preceq M^2 I$. Denote $C_n^{uu}$ as the empirical covariance. Choose adaptive step sizes $h_n= \theta \left\|C_n where $\lambda_k(C_n^{uu})$ is the $k$-th largest eigenvalues of $C_n^{uu}$ and $r$ is the rank of $

Figures (4)

Figure 1: Comparison of DEKI with vanilla EKI and LEKI in linear transport equation model with $d_u = 120, J = 20$. Left: decay of the relative data misfit \ref{['eq:RelaMisfit']}, here LEKI w/o stands for LEKI without prior knowledge of the transport speed. Right: collapse of the ensemble, the solid line represents $\max_{s} C_n^{uu}(s,s)$ and the dashed line represents $\min_{s} C_n^{uu}(s,s)$.
Figure 2: Linear dependence of the convergence rate \ref{['eq:ConvRate']} of DEKI on reciprocal of dimension $d_u^{-1}$ and ensemble size $J$, tested in linear transport equation model.
Figure 3: Comparison of DEKI and EKI in Darcy's law model in setup 1 ($d_u=136$) using ensemble size $J=15$. Left: relative data misfit \ref{['eq:RelaMisfit']}; right: relative solution error \ref{['eq:RelaError']}.
Figure 4: Comparison of the inversion solutions of EKI and DEKI in Darcy's law model in different setups with ensemble size $J=15$. Row 1: setup 1 $(l_x = 0.1, l_y = 0.1, d_u=136)$; Row 2: setup 2 $(l_x = 0.2, l_y = 0.05, d_u=142)$; Row 3: setup 3 $(l_x = 0.15, l_y = 0.05, d_u=179)$; Row 4: setup 4 $(l_x = 0.1, l_y = 0.05, d_u=254)$.

Theorems & Definitions (19)

Proposition 3.3
Proof 1
Proposition 3.4
Remark 3.5
Proof 2
Proposition 3.6
Theorem 3.7
Lemma 3.9
Proof 3: Proof for \ref{['thm:convergence']}
Lemma 3.10
...and 9 more

Dropout Ensemble Kalman inversion for high dimensional inverse problems

TL;DR

Abstract

Dropout Ensemble Kalman inversion for high dimensional inverse problems

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (19)