Causal Structure Learning in Directed, Possibly Cyclic, Graphical Models
Pardis Semnani, Elina Robeva
TL;DR
This work addresses causal discovery for directed graphs that may contain cycles by assuming a distribution that is Markov and faithful to the unknown graph $G^\star$ with no latent variables. It introduces a two-step hybrid approach: first, a greedy search over partially ordered partitions guided by a sparsity-based graphical score GS to identify the Markov equivalence class (MEC) of $G^\star$, and second, two SCCR algorithms to construct a graph in that MEC, with strongly connected components aligned to the partition. The key contributions include the (i) Richardson-style Markov-equivalence characterization for cyclic graphs, (ii) a concrete, lexicographically-ordered score GS whose minimizers uniquely determine the MEC, and (iii) two practical and theoretically-grounded SCCR methods (construct-and-correct and submodular-flow-based) to realize a graph in the MEC. Experiments on simulated data up to around $n=10$ demonstrate the approach's viability, suggesting a path toward reliable causal discovery in settings with cycles and without parametric assumptions. The framework lays groundwork for relaxing latent-variable assumptions and extending to more general structural equation models in future work.
Abstract
We consider the problem of learning a directed graph $G^\star$ from observational data. We assume that the distribution which gives rise to the samples is Markov and faithful to the graph $G^\star$ and that there are no unobserved variables. We do not rely on any further assumptions regarding the graph or the distribution of the variables. Particularly, we allow for directed cycles in $G^\star$ and work in the fully non-parametric setting. Given the set of conditional independence statements satisfied by the distribution, we aim to find a directed graph which satisfies the same $d$-separation statements as $G^\star$. We propose a hybrid approach consisting of two steps. We first find a partially ordered partition of the vertices of $G^\star$ by optimizing a certain score in a greedy fashion. We prove that any optimal partition uniquely characterizes the Markov equivalence class of $G^\star$. Given an optimal partition, we propose an algorithm for constructing a graph in the Markov equivalence class of $G^\star$ whose strongly connected components correspond to the elements of the partition, and which are partially ordered according to the partial order of the partition. Our algorithm comes in two versions -- one which is provably correct and another one which performs fast in practice.
