An Asymmetric Independence Model for Causal Discovery on Path Spaces
Georg Manten, Cecilia Casolo, Søren Wengel Mogensen, Niki Kilbertus
TL;DR
This work introduces E-separation, an asymmetric, time-aware separation criterion for directed mixed graphs to enable causal discovery from path-space observations in stochastic differential equations, including cyclic dynamics and partial observability. It develops a lifted dependency graph framework, defines asymmetric graphoid properties, and proves a dynamic global Markov property linking CI on path space to E-separation. The paper characterizes Markov equivalence under E-separation, proves the existence of a greatest element within each equivalence class for fully observed graphs, extends these results to latent models via latent projection, and provides experimental validation including an algorithm for causal discovery on SDEs. These results advance causal inference in continuous-time dynamical systems by leveraging time directionality and providing provable, data-recoverable structure in the presence of cycles and hidden variables.
Abstract
We develop the theory linking 'E-separation' in directed mixed graphs (DMGs) with conditional independence relations among coordinate processes in stochastic differential equations (SDEs), where causal relationships are determined by "which variables enter the governing equation of which other variables". We prove a global Markov property for cyclic SDEs, which naturally extends to partially observed cyclic SDEs, because our asymmetric independence model is closed under marginalization. We then characterize the class of graphs that encode the same set of independence relations, yielding a result analogous to the seminal 'same skeleton and v-structures' result for directed acyclic graphs (DAGs). In the fully observed case, we show that each such equivalence class of graphs has a greatest element as a parsimonious representation and develop algorithms to identify this greatest element from data. We conjecture that a greatest element also exists under partial observations, which we verify computationally for graphs with up to four nodes.
