Meta-Dependence in Conditional Independence Testing
Bijan Mazaheri, Jiaqi Zhang, Caroline Uhler
TL;DR
This work tackles the problem of meta-dependence among conditional independence tests used in constraint-based causal discovery by introducing CIMD, an information-theoretic measure based on information projections. CIMD quantifies how projecting a distribution onto one CI constraint changes its proximity to another CI constraint, capturing shared information between CI tests and the resulting co-occurrence of independence judgments. The authors provide practical computation methods, including an MLE-based approach for general distributions and closed-form formulas for multivariate Gaussians, and validate the approach with synthetic and real-world datasets, showing alignment between CIMD and finite-sample CI co-occurrences (FS-CID). The results offer a principled way to adjust significance thresholds in causal discovery and point toward robust “confidence ensembles” of causal structures under distributional uncertainty, with broad applicability beyond Gaussian models.
Abstract
Constraint-based causal discovery algorithms utilize many statistical tests for conditional independence to uncover networks of causal dependencies. These approaches to causal discovery rely on an assumed correspondence between the graphical properties of a causal structure and the conditional independence properties of observed variables, known as the causal Markov condition and faithfulness. Finite data yields an empirical distribution that is "close" to the actual distribution. Across these many possible empirical distributions, the correspondence to the graphical properties can break down for different conditional independencies, and multiple violations can occur at the same time. We study this "meta-dependence" between conditional independence properties using the following geometric intuition: each conditional independence property constrains the space of possible joint distributions to a manifold. The "meta-dependence" between conditional independences is informed by the position of these manifolds relative to the true probability distribution. We provide a simple-to-compute measure of this meta-dependence using information projections and consolidate our findings empirically using both synthetic and real-world data.
