The Observational Partial Order of Causal Structures with Latent Variables
Marina Maciel Ansanelli, Elie Wolfe, Robert W. Spekkens
TL;DR
This work introduces the observational partial order on causal structures with latent variables, focusing on how observational distributions over visible variables distinguish competing structures. It provides a complete characterization for the partial order among 3-node mDAGs and a partial solution for 4-node cases, using a toolkit of dominance/nondominance rules (including HLP Edge-Adding and progressive Facet-Merging) and the analysis of inequality constraints. A key finding is that nonalgebraic (inequality-constraint) causal structures are increasingly ubiquitous as the number of visible variables grows, implying widespread potential quantum-classical gaps and signaling the inadequacy of CI-only methods for causal discovery in larger graphs. The results underscore the importance of inequality and nested Markov constraints for effective causal discovery and model selection, and they provide a foundation for identifying which observational equivalence classes are uniquely identifiable from data. The work also connects to quantum causality by highlighting when classical models can be decisively ruled out without fine-tuning, based on inequality constraints.
Abstract
For two causal structures with the same set of visible variables, one is said to observationally dominate the other if the set of distributions over the visible variables realizable by the first contains the set of distributions over the visible variables realizable by the second. Knowing such dominance relations is useful for adjudicating between these structures given observational data. We here consider the problem of determining the partial order of equivalence classes of causal structures with latent variables relative to observational dominance. We provide a complete characterization of the dominance order in the case of three visible variables, and a partial characterization in the case of four visible variables. Our techniques also help to identify which observational equivalence classes have a set of realizable distributions that is characterized by nontrivial inequality constraints, analogous to Bell inequalities and instrumental inequalities. We find evidence that as one increases the number of visible variables, the equivalence classes satisfying nontrivial inequality constraints become ubiquitous. (Because such classes are the ones for which there can be a difference in the distributions that are quantumly and classically realizable, this implies that the potential for quantum-classical gaps is also ubiquitous.) Furthermore, we find evidence that constraint-based causal discovery algorithms that rely solely on conditional independence constraints have a significantly weaker distinguishing power among observational equivalence classes than algorithms that go beyond these (i.e., algorithms that also leverage nested Markov constraints and inequality constraints).
