On Different Notions of Redundancy in Conditional-Independence-Based Discovery of Graphical Models
Philipp M. Faller, Dominik Janzing
TL;DR
This paper investigates how conditional-independence-based graph discovery can leverage redundant CI-statements to detect and correct errors. It distinguishes purely graphical redundancy—statements that follow from graphical structure but not from probabilistic laws—from Graphoid-redundancy, which follows from semi-Graphoid axioms and holds for all distributions, thereby offering limited error-detection value. By analyzing spanning trees and DAGs, the authors derive criteria and sufficient conditions under which purely graphical redundancy can reveal mis-specifications and even correct certain errors, while warning that some redundant statements may mislead if applied indiscriminately. Empirical results show that purely graphically redundant CI-tests can indicate more errors than Graphoid-redundant tests and can improve certain error-correction strategies (e.g., Minimum Markov-Distance for trees), though corrections in DAGs are more constrained and require additional assumptions. Overall, the work advances a framework for evaluating and enhancing CI-based graph discovery by accounting for the intertwined roles of graphical and probabilistic constraints, with implications for robust causal structure learning and model validation.
Abstract
Conditional-independence-based discovery uses statistical tests to identify a graphical model that represents the independence structure of variables in a dataset. These tests, however, can be unreliable, and algorithms are sensitive to errors and violated assumptions. Often, there are tests that were not used in the construction of the graph. In this work, we show that these redundant tests have the potential to detect or sometimes correct errors in the learned model. But we further show that not all tests contain this additional information and that such redundant tests have to be applied with care. Precisely, we argue that the conditional (in)dependence statements that hold for every probability distribution are unlikely to detect and correct errors - in contrast to those that follow only from graphical assumptions.
