Choosing DAG Models Using Markov and Minimal Edge Count in the Absence of Ground Truth
Joseph D. Ramsey, Bryan Andrews, Peter Spirtes
TL;DR
The paper tackles the challenge of validating causal DAG/CPDAG models from data without ground truth by introducing the Markov Checker, a nonparametric pointwise consistent test of the Markov condition, and CAFS, a cross-algorithm search that favors models with minimal edges among those passing the test. The core idea is to assess whether the p-values from conditional-independence tests implied by a candidate graph are uniformly distributed under the null, enabling rejection of models that do not align with the population distribution, and then select frugal models. Key contributions include (i) formalizing MC testing via uniformity of p-values, (ii) relaxing the sparsest representation criterion to frugality and enabling comparisons across diverse algorithms, (iii) a practical software implementation within the Tetrad suite with large-scale capability, and (iv) simulation and empirical demonstrations showing that near-ground-truth models can be identified without ground-truth labels. The approach offers a scalable, ground-truth-free tool for domain experts to prune and tune causal structure learning pipelines, particularly for dense graphs where traditional reliability guarantees may fail.
Abstract
We give a novel nonparametric pointwise consistent statistical test (the Markov Checker) of the Markov condition for directed acyclic graph (DAG) or completed partially directed acyclic graph (CPDAG) models given a dataset. We also introduce the Cross-Algorithm Frugality Search (CAFS) for rejecting DAG models that either do not pass the Markov Checker test or that are not edge minimal. Edge minimality has been used previously by Raskutti and Uhler as a nonparametric simplicity criterion, though CAFS readily generalizes to other simplicity conditions. Reference to the ground truth is not necessary for CAFS, so it is useful for finding causal structure learning algorithms and tuning parameter settings that output causal models that are approximately true from a given data set. We provide a software tool for this analysis that is suitable for even quite large or dense models, provided a suitably fast pointwise consistent test of conditional independence is available. In addition, we show in simulation that the CAFS procedure can pick approximately correct models without knowing the ground truth.
