Modeling and Discovering Direct Causes for Predictive Models
Yizuo Chen, Amit Bhatia
TL;DR
The paper tackles identifying which input features directly drive predictions from black-box models by embedding predictive models into causal graphs (ADMGs). It shows that the direct causes correspond to the parents of $Y$ in a predictive graph, and under canonicity or weak faithfulness these sources coincide with the Markov boundary $\mathrm{MB}(Y)$, enabling discovery from observational data. The authors provide sound and complete algorithms for identifying direct causes via Markov-boundary discovery and introduce an independence-rule optimization (I-decomposability) to accelerate discovery, with theoretical guarantees. Empirical results demonstrate substantial reductions in computation and independence tests, particularly when the predictive graph has many direct causes, underscoring practical benefits for explainability and efficient data collection in complex predictive systems.
Abstract
We introduce a causal modeling framework that captures the input-output behavior of predictive models (e.g., machine learning models). The framework enables us to identify features that directly cause the predictions, which has broad implications for data collection and model evaluation. We then present sound and complete algorithms for discovering direct causes (from data) under some assumptions. Furthermore, we propose a novel independence rule that can be integrated with the algorithms to accelerate the discovery process, as we demonstrate both theoretically and empirically.
