CausalDynamics: A large-scale benchmark for structural discovery of dynamical causal models
Benjamin Herdeanu, Juan Nathaniel, Carla Roesch, Jatan Buch, Gregor Ramien, Johannes Haux, Pierre Gentine
TL;DR
The paper tackles the lack of robust benchmarks for causal discovery in nonlinear dynamical systems where interventions are impractical. It introduces CausalDynamics, a tiered, extensible data-generation framework that yields thousands of ground-truth structural dynamical causal models from both ordinary/stochastic differential equations and pseudo-real climate models. Key contributions include the largest benchmark of over 14k graphs with ground-truth causal graphs, a novel Growing Network with Redirection graph generator, an integrated evaluation workflow, and baseline results showing DL methods lag behind simpler approaches in many settings. The framework enables robust development of causal-discovery methods capable of handling high-dimensional, nonlinear, time-lagged, and confounded dynamics across diverse domains, with practical impact for scientific inference under real-world constraints.
Abstract
Causal discovery for dynamical systems poses a major challenge in fields where active interventions are infeasible. Most methods used to investigate these systems and their associated benchmarks are tailored to deterministic, low-dimensional and weakly nonlinear time-series data. To address these limitations, we present CausalDynamics, a large-scale benchmark and extensible data generation framework to advance the structural discovery of dynamical causal models. Our benchmark consists of true causal graphs derived from thousands of both linearly and nonlinearly coupled ordinary and stochastic differential equations as well as two idealized climate models. We perform a comprehensive evaluation of state-of-the-art causal discovery algorithms for graph reconstruction on systems with noisy, confounded, and lagged dynamics. CausalDynamics consists of a plug-and-play, build-your-own coupling workflow that enables the construction of a hierarchy of physical systems. We anticipate that our framework will facilitate the development of robust causal discovery algorithms that are broadly applicable across domains while addressing their unique challenges. We provide a user-friendly implementation and documentation on https://kausable.github.io/CausalDynamics.
