Table of Contents
Fetching ...

Physics as the Inductive Bias for Causal Discovery

Jianhong Chen, Naichen Shi, Xubo Yue

TL;DR

This work addresses causal discovery in dynamical systems by embedding partial physical knowledge into a stochastic differential equation framework, where the drift g(t,X_t,γ) encodes known physics and the diffusion S_A(X_t) captures unknown causal couplings. It introduces SCD, a scalable gradient-based estimator that learns the causal graph from trajectory data via an Euler–Maruyama Gaussian quasi-likelihood with an ℓ1 penalty to promote sparsity; theory shows high-probability recovery of the true graph under mild Lyapunov-stability conditions, without requiring acyclicity or additive noise. Empirical results on both DAGs and cyclic (loop) graphs demonstrate superior graph-recovery performance (lower SHD, higher TPR, lower FDR) and greater physical consistency than state-of-the-art baselines, including in nonstationary settings. The approach offers a principled route to physics-informed causal reasoning in complex dynamical systems and suggests several avenues for extending causal discovery to richer mechanistic forms and spatiotemporal data.

Abstract

Causal discovery is often a data-driven paradigm to analyze complex real-world systems. In parallel, physics-based models such as ordinary differential equations (ODEs) provide mechanistic structure for many dynamical processes. Integrating these paradigms potentially allows physical knowledge to act as an inductive bias, improving identifiability, stability, and robustness of causal discovery in dynamical systems. However, such integration remains challenging: real dynamical systems often exhibit feedback, cyclic interactions, and non-stationary data trend, while many widely used causal discovery methods are formulated under acyclicity or equilibrium-based assumptions. In this work, we propose an integrative causal discovery framework for dynamical systems that leverages partial physical knowledge as an inductive bias. Specifically, we model system evolution as a stochastic differential equation (SDE), where the drift term encodes known ODE dynamics and the diffusion term corresponds to unknown causal couplings beyond the prescribed physics. We develop a scalable sparsity-inducing MLE algorithm that exploits causal graph structure for efficient parameter estimation. Under mild conditions, we establish guarantees to recover the causal graph. Experiments on dynamical systems with diverse causal structures show that our approach improves causal graph recovery and produces more stable, physically consistent estimates than purely data-driven state-of-the-art baselines.

Physics as the Inductive Bias for Causal Discovery

TL;DR

This work addresses causal discovery in dynamical systems by embedding partial physical knowledge into a stochastic differential equation framework, where the drift g(t,X_t,γ) encodes known physics and the diffusion S_A(X_t) captures unknown causal couplings. It introduces SCD, a scalable gradient-based estimator that learns the causal graph from trajectory data via an Euler–Maruyama Gaussian quasi-likelihood with an ℓ1 penalty to promote sparsity; theory shows high-probability recovery of the true graph under mild Lyapunov-stability conditions, without requiring acyclicity or additive noise. Empirical results on both DAGs and cyclic (loop) graphs demonstrate superior graph-recovery performance (lower SHD, higher TPR, lower FDR) and greater physical consistency than state-of-the-art baselines, including in nonstationary settings. The approach offers a principled route to physics-informed causal reasoning in complex dynamical systems and suggests several avenues for extending causal discovery to richer mechanistic forms and spatiotemporal data.

Abstract

Causal discovery is often a data-driven paradigm to analyze complex real-world systems. In parallel, physics-based models such as ordinary differential equations (ODEs) provide mechanistic structure for many dynamical processes. Integrating these paradigms potentially allows physical knowledge to act as an inductive bias, improving identifiability, stability, and robustness of causal discovery in dynamical systems. However, such integration remains challenging: real dynamical systems often exhibit feedback, cyclic interactions, and non-stationary data trend, while many widely used causal discovery methods are formulated under acyclicity or equilibrium-based assumptions. In this work, we propose an integrative causal discovery framework for dynamical systems that leverages partial physical knowledge as an inductive bias. Specifically, we model system evolution as a stochastic differential equation (SDE), where the drift term encodes known ODE dynamics and the diffusion term corresponds to unknown causal couplings beyond the prescribed physics. We develop a scalable sparsity-inducing MLE algorithm that exploits causal graph structure for efficient parameter estimation. Under mild conditions, we establish guarantees to recover the causal graph. Experiments on dynamical systems with diverse causal structures show that our approach improves causal graph recovery and produces more stable, physically consistent estimates than purely data-driven state-of-the-art baselines.
Paper Structure (38 sections, 10 theorems, 148 equations, 5 figures, 6 tables)

This paper contains 38 sections, 10 theorems, 148 equations, 5 figures, 6 tables.

Key Result

Proposition 1

Let the SDE satisfy the assumptions from §SDE_gen and $T<\infty$ (i.e. locally Lipschitz, linear growth condition, and the square-integrable initial condition). Then, the SDE defined in Eq.GSDE, admits a unique strong solution with continuous sample paths. If Assumption.ass:stable holds, there exist

Figures (5)

  • Figure 1: A motivating example of causal discovery under partially known physics. (A) Top: the system structure with physics available for a subset of variables, together with the corresponding adjacency graph. Bottom: simulated time-series data generated from an SDE that incorporates uncertainty-aware unmodeled couplings for the given graph. (B) Top: results (shown as heatmaps) obtained using DYNOTEARS to estimate causal relationships from each time-series dataset. We use lag order 1 and aggregate the estimated contemporaneous and lag-1 graphs into a single heatmap for visualization. Hyperparameters are tuned following pamfil2020dynotears. Bottom: the estimated graph produced by our method, SCD. We emphasize that DYNOTEARS is designed for stationary linear SVAR; the SDE-driven observations here can exhibit nonstationarity, leading to a model-mismatch regime where DYNOTEARS is not expected to perform optimally.
  • Figure 2: Results on DAGs (stable system): mean/std over 10 runs (SHD, TPR, FDR). The full table is provided in Tab.\ref{['tab:dag_stable']}.
  • Figure 3: Results on directed loop graphs (stable system): mean/std over 10 runs (SHD, TPR, FDR). The full table is provided in Tab.\ref{['tab:loop_stable']}.
  • Figure S1: Known ODE dynamics (black box), given by Hes1 system, provide mechanistic prior information, while learning the causal structure (green oval), including additional variables (red box), is the key task in our setup.
  • Figure S2: A visualization of dataset

Theorems & Definitions (24)

  • Definition 1: Causal Coupling Beyond Known Dynamics
  • Remark 1
  • Proposition 1
  • Remark 2
  • Theorem 2: Causal identifiability
  • Lemma 1
  • Lemma 2
  • Lemma 3
  • Lemma 4
  • Lemma 5: High-probability Hessian Lipschitz
  • ...and 14 more