Shapley-PC: Constraint-based Causal Structure Learning with a Shapley Inspired Framework
Fabrizio Russo, Francesca Toni
TL;DR
Shapley-PC introduces a Shapley-value–based decision rule to orient v-structures in constraint-based CSL, integrated into PC-Stable. It provides theoretical guarantees of soundness, completeness, and asymptotic consistency, and demonstrates empirical gains over existing PC-based methods, especially on denser graphs. The approach aggregates evidence across multiple CI tests via Shapley Independence Values, reducing sensitivity to individual test errors and improving robustness to near-unfaithful distributions. Extensive simulations and pseudo-real data experiments show consistent improvements in collider identification and overall graph recovery, with manageable computational overhead. These results suggest a practical, theoretically grounded enhancement to constraint-based causal discovery that can be extended to related CSL frameworks and alternative CI metrics.
Abstract
Causal Structure Learning (CSL), also referred to as causal discovery, amounts to extracting causal relations among variables in data. CSL enables the estimation of causal effects from observational data alone, avoiding the need to perform real life experiments. Constraint-based CSL leverages conditional independence tests to perform causal discovery. We propose Shapley-PC, a novel method to improve constraint-based CSL algorithms by using Shapley values over the possible conditioning sets, to decide which variables are responsible for the observed conditional (in)dependences. We prove soundness, completeness and asymptotic consistency of Shapley-PC and run a simulation study showing that our proposed algorithm is superior to existing versions of PC.
