Multiply-Robust Causal Change Attribution
Victor Quintas-Martinez, Mohammad Taha Bahadori, Eduardo Santiago, Jeff Mu, Dominik Janzing, David Heckerman
TL;DR
This paper tackles causal change attribution: disentangling how multiple causal mechanisms contribute to differences in an outcome distribution across two samples. It introduces a multiply robust estimation framework that combines regression and re-weighting to identify counterfactual distributions under a fixed causal DAG, proving consistency and asymptotic normality under weak ML-learning conditions. The authors show how to estimate per-mechanism contributions via Shapley values or along causal paths, with theoretical guarantees that the attribution measures inherit the estimator’s large-sample properties. Empirically, the method demonstrates strong robustness in Monte Carlo simulations and yields interpretable insights in a gender wage-gap study, with practical implementation in the Python library DoWhy, enabling reliable causal attribution in applied settings.
Abstract
Comparing two samples of data, we observe a change in the distribution of an outcome variable. In the presence of multiple explanatory variables, how much of the change can be explained by each possible cause? We develop a new estimation strategy that, given a causal model, combines regression and re-weighting methods to quantify the contribution of each causal mechanism. Our proposed methodology is multiply robust, meaning that it still recovers the target parameter under partial misspecification. We prove that our estimator is consistent and asymptotically normal. Moreover, it can be incorporated into existing frameworks for causal attribution, such as Shapley values, which will inherit the consistency and large-sample distribution properties. Our method demonstrates excellent performance in Monte Carlo simulations, and we show its usefulness in an empirical application. Our method is implemented as part of the Python library DoWhy (arXiv:2011.04216, arXiv:2206.06821).
