Causal Temporal Regime Structure Learning
Abdellah Rahmani, Pascal Frossard
TL;DR
CASTOR tackles causal structure learning in multivariate time series that exhibit multiple, sequential regimes with regime-specific graphs. It casts the problem as learning a mixture over regimes and employs an EM algorithm to jointly infer the number of regimes, their segmentation, and the per-regime temporal DAGs, with identifiability guarantees up to permutation under Gaussian noise. The approach handles both instantaneous and time-lagged effects and supports linear and nonlinear relationships via parametric SEMs and neural networks, respectively. Empirical results on synthetic and real datasets show CASTOR outperforms existing methods in regime detection and DAG learning, confirming its practical value for non-stationary dynamic systems. The work provides a principled framework for regime-aware causal discovery with theoretical identifiability and scalable learning.
Abstract
Understanding causal relationships in multivariate time series is essential for predicting and controlling dynamic systems in fields like economics, neuroscience, and climate science. However, existing causal discovery methods often assume stationarity, limiting their effectiveness when time series consist of sequential regimes, consecutive temporal segments with unknown boundaries and changing causal structures. In this work, we firstly introduce a framework to describe and model such time series. Then, we present CASTOR, a novel method that concurrently learns the Directed Acyclic Graph (DAG) for each regime while determining the number of regimes and their sequential arrangement. CASTOR optimizes the data log-likelihood using an expectation-maximization algorithm, alternating between assigning regime indices (expectation step) and inferring causal relationships in each regime (maximization step). We establish the identifiability of the regimes and DAGs within our framework. Extensive experiments show that CASTOR consistently outperforms existing causal discovery models in detecting different regimes and learning their DAGs across various settings, including linear and nonlinear causal relationships, on both synthetic and real world datasets.
