Stable Differentiable Causal Discovery
Achille Nazaret, Justin Hong, Elham Azizi, David Blei
TL;DR
The paper tackles learning causal graphs represented as directed acyclic graphs ($\mathrm{DAGs}$) from observational and interventional data. It identifies instability in existing differentiable causal discovery (DCD) methods and introduces Stable Differentiable Causal Discovery (SDCD) that uses a stable spectral acyclicity constraint $h_\rho(A)$ with a two-stage edge-pruning process. The authors prove stability and correctness, and demonstrate through extensive experiments that SDCD converges faster, improves accuracy, and scales to thousands of variables for both observational and interventional data, with code available at the provided repository. This approach enables reliable large-scale causal discovery in data-rich settings, expanding applicability of DCD to real-world problems.
Abstract
Inferring causal relationships as directed acyclic graphs (DAGs) is an important but challenging problem. Differentiable Causal Discovery (DCD) is a promising approach to this problem, framing the search as a continuous optimization. But existing DCD methods are numerically unstable, with poor performance beyond tens of variables. In this paper, we propose Stable Differentiable Causal Discovery (SDCD), a new method that improves previous DCD methods in two ways: (1) It employs an alternative constraint for acyclicity; this constraint is more stable, both theoretically and empirically, and fast to compute. (2) It uses a training procedure tailored for sparse causal graphs, which are common in real-world scenarios. We first derive SDCD and prove its stability and correctness. We then evaluate it with both observational and interventional data and on both small-scale and large-scale settings. We find that SDCD outperforms existing methods in both convergence speed and accuracy and can scale to thousands of variables. We provide code at https://github.com/azizilab/sdcd.
