Table of Contents
Fetching ...

Assimilative Causal Inference

Marios Andreou, Nan Chen, Erik Bollt

TL;DR

The paper tackles the challenge of detecting instantaneous, time-varying causal relationships in complex dynamical systems from limited observations. It introduces Assimilative Causal Inference (ACI), which recasts causality as a Bayesian inverse problem by comparing smoother and filter posteriors and using relative entropy to declare instantaneous links. ACI further defines the Causal Influence Range (CIR) to quantify how far effects propagate in time, with objective CIR implemented via averaging over thresholds; online smoothing enables real time tracking and extends to conditional ACI to handle non-target variables. Demonstrations on nonlinear CGNS models, including a dyad with extreme events and an ENSO diversity model, show how CIR patterns reveal early triggering, indirect pathways, and regime-dependent causal structure. The framework is scalable to high dimensions, accommodates short time series, and preserves dynamical integrity, offering broad potential for climate science, neuroscience, and other complex systems.

Abstract

Causal inference determines cause-and-effect relationships between variables and has broad applications across disciplines. Traditional time-series methods often reveal causal links only in a time-averaged sense, while ensemble-based information transfer approaches detect the time evolution of short-term causal relationships but are typically limited to low-dimensional systems. In this paper, a new causal inference framework, called assimilative causal inference (ACI), is developed. Fundamentally different from the state-of-the-art methods, ACI uses a dynamical system and a single realization of a subset of the state variables to identify instantaneous causal relationships and the dynamic evolution of the associated causal influence range (CIR). Instead of quantifying how causes influence effects as done traditionally, ACI solves an inverse problem via Bayesian data assimilation, thus tracing causes backward from observed effects with an implicit Bayesian hypothesis. Causality is determined by assessing whether incorporating the information of the effect variables reduces the uncertainty in recovering the potential cause variables. ACI has several desirable features. First, it captures the dynamic interplay of variables, where their roles as causes and effects can shift repeatedly over time. Second, a mathematically justified objective criterion determines the CIR without empirical thresholds. Third, ACI is scalable to high-dimensional problems by leveraging computationally efficient Bayesian data assimilation techniques. Finally, ACI applies to short time series and incomplete datasets. Notably, ACI does not require observations of candidate causes, which is a key advantage since potential drivers are often unknown or unmeasured. The effectiveness of ACI is demonstrated by complex dynamical systems showcasing intermittency and extreme events.

Assimilative Causal Inference

TL;DR

The paper tackles the challenge of detecting instantaneous, time-varying causal relationships in complex dynamical systems from limited observations. It introduces Assimilative Causal Inference (ACI), which recasts causality as a Bayesian inverse problem by comparing smoother and filter posteriors and using relative entropy to declare instantaneous links. ACI further defines the Causal Influence Range (CIR) to quantify how far effects propagate in time, with objective CIR implemented via averaging over thresholds; online smoothing enables real time tracking and extends to conditional ACI to handle non-target variables. Demonstrations on nonlinear CGNS models, including a dyad with extreme events and an ENSO diversity model, show how CIR patterns reveal early triggering, indirect pathways, and regime-dependent causal structure. The framework is scalable to high dimensions, accommodates short time series, and preserves dynamical integrity, offering broad potential for climate science, neuroscience, and other complex systems.

Abstract

Causal inference determines cause-and-effect relationships between variables and has broad applications across disciplines. Traditional time-series methods often reveal causal links only in a time-averaged sense, while ensemble-based information transfer approaches detect the time evolution of short-term causal relationships but are typically limited to low-dimensional systems. In this paper, a new causal inference framework, called assimilative causal inference (ACI), is developed. Fundamentally different from the state-of-the-art methods, ACI uses a dynamical system and a single realization of a subset of the state variables to identify instantaneous causal relationships and the dynamic evolution of the associated causal influence range (CIR). Instead of quantifying how causes influence effects as done traditionally, ACI solves an inverse problem via Bayesian data assimilation, thus tracing causes backward from observed effects with an implicit Bayesian hypothesis. Causality is determined by assessing whether incorporating the information of the effect variables reduces the uncertainty in recovering the potential cause variables. ACI has several desirable features. First, it captures the dynamic interplay of variables, where their roles as causes and effects can shift repeatedly over time. Second, a mathematically justified objective criterion determines the CIR without empirical thresholds. Third, ACI is scalable to high-dimensional problems by leveraging computationally efficient Bayesian data assimilation techniques. Finally, ACI applies to short time series and incomplete datasets. Notably, ACI does not require observations of candidate causes, which is a key advantage since potential drivers are often unknown or unmeasured. The effectiveness of ACI is demonstrated by complex dynamical systems showcasing intermittency and extreme events.

Paper Structure

This paper contains 48 sections, 6 theorems, 93 equations, 12 figures.

Key Result

Theorem A.1

Assume that $\exists \mathcal{M}(t):=\lVert \hat{\delta}(\cdot;t)\rVert_{L^\infty(\mathrm{I})}=\lVert \delta(\cdot;t)\rVert_{L^\infty(\mathrm{[t,T]})}>0$ for each $t\in[0,T]$. Then, for $t\in[0,T]$: where $\hat{\delta}(\tau;t)$, $\mathrm{J}_t(\varepsilon)$, and $\overset{\sim}{\uptau}_{\mathbf{y}(t)\overset{{\text{\normalfont{ACI}}}}{\longrightarrow}\mathbf{x}}(\varepsilon)$ are defined in eq:std

Figures (12)

  • Figure 2.1: The assimilative causal inference (ACI) framework. Panel (a): A high-level overview of the method. Panel (b): Schematic illustration of the ACI from a more technical viewpoint. Panel (c): ACI in the presence of non-target variables. Panel (d): Objective CIR, as an integration of the associated subjective CIRs.
  • Figure 4.1: Data assimilation of the dyad model \ref{['eq:Dyad_model_SI']}. Panel (a): A single realization of the observed variable $x$. Panel (b): The true hidden signal $y$ (blue) alongside the posterior mean estimates from filtering (green) and smoothing (red), where the smoother is the complete smoother using all the information in future. The dashed line marks the anti-damping threshold, above which the net damping $-d_x + \gamma y$ in \ref{['eq:Dyad_model_x_SI']} becomes positive. Panel (c): Posterior variance of the filtered and smoothed estimates of $y$.
  • Figure 4.2: ACI values and CIRs for the nonlinear dyad model \ref{['eq:Dyad_model_SI']} from $y$ to $x$ as functions of time. Panel (a): Time series of $x$ (magenta) and $y$ (blue), with the objective CIR depicted as whiskers extending forward in time from each $y(t)$. The dashed horizontal line marks the anti-damping threshold $d_x/\gamma$. Panel (b): Subjective CIR (shaded region, logarithmically scaled) as a function of the threshold $\epsilon$ (logarithmic, reversed $y$-axis). Panel (c): ACI values from $y$ to $x$ over time.
  • Figure 4.3: Data assimilation of the noisy predator-prey model \ref{['eq:predator_prey']}. Panels (a)--(c) and (d)--(f) show the results by observing $y$ (recovering $x$) and observing $x$ (recovering $y$), respectively. The dashed lines in Panels (b) and (e) indicate the anti-damping threshold values.
  • Figure 4.4: ACI values and CIRs for the noisy predator-prey model \ref{['eq:predator_prey']}. Only the objective CIRs are shown. Panels (a)--(b) and (c)--(d) show the results from $x(t)$ to $y$ and from $y(t)$ to $x$, respectively. The dashed lines in Panels (a) and (b) indicate the anti-damping threshold values in the equations of $y$ and $x$, respectively.
  • ...and 7 more figures

Theorems & Definitions (11)

  • Theorem A.1: Computing the Objective CIR
  • proof : Proof of Theorem \ref{['thm:obj_subj_CIR_connection']}
  • Theorem B.1: Optimal nonlinear filter state estimation equations for CGNSs liptser2001statistics
  • Theorem B.2: Optimal nonlinear smoother state estimation backward equations for CGNSs liptser2001statistics
  • Theorem B.3: Optimal online smoother for CGNSs andreou2024adaptive
  • Theorem C.1: Principle of Nil Assimilative Causality for CGNSs
  • proof : Proof of Theorem \ref{['thm:nilcausality']}
  • Theorem C.2: Principle of Nil Conditional Assimilative Causality for CGNSs
  • proof : Proof of Theorem \ref{['thm:nilcondcausality']}
  • Remark C.1: Analytical Meaning of $\mathrm{Var}(\mathbf{x}_{\text{\normalfont{B}}}(t))\to+\infty$
  • ...and 1 more