Learning Directed Acyclic Graphs from Partial Orderings
Ali Shojaie, Wenyu Chen
TL;DR
The paper tackles learning DAG structure from partial causal orderings, focusing on a two-layer bipartite setting where $\mathcal{X}\prec\mathcal{Y}$. It introduces PODAG, a framework that combines an intersection of naive cross-layer edge candidates with a search restricted to conditional Markov blankets, enabling efficient and accurate recovery of cross-layer edges under layering-adjacency-faithfulness. It establishes theoretical guarantees for both low- and high-dimensional regimes, demonstrates improved faithfulness requirements over traditional PC-based methods, and shows practical gains in applications such as eQTL mapping. The work extends naturally to multiple layers and nonparametric models, offering a versatile approach for causal graph learning with partial order information and broad applicability in biology and genomics.
Abstract
Directed acyclic graphs (DAGs) are commonly used to model causal relationships among random variables. In general, learning the DAG structure is both computationally and statistically challenging. Moreover, without additional information, the direction of edges may not be estimable from observational data. In contrast, given a complete causal ordering of the variables, the problem can be solved efficiently, even in high dimensions. In this paper, we consider the intermediate problem of learning DAGs when a partial causal ordering of variables is available. We propose a general estimation framework for leveraging the partial ordering and present efficient estimation algorithms for low- and high-dimensional problems. The advantages of the proposed framework are illustrated via numerical studies.
