Hierarchical Bias-Driven Stratification for Interpretable Causal Effect Estimation
Lucile Ter-Minassian, Liran Szlak, Ehud Karavani, Chris Holmes, Yishai Shimoni
TL;DR
This work tackles the need for transparent causal effect estimation from observational data by introducing BICauseTree, an interpretable balancing method that uses a decision-tree framework to discover local natural experiments and identify positivity violations. The method optimizes splits to maximize covariate imbalance reduction via Absolute Standardized Mean Difference, incorporates a principled pruning and positivity-filtering procedure, and allows leaf-wise estimation with flexible outcome/propensity models, yielding a covariate-based inferentiable target population. Across synthetic and real benchmark datasets, BICauseTree achieves competitive bias with interpretable partitions, effectively abstaining in regions with poor overlap and enabling policy-relevant, covariate-defined inferences. The approach combines transparency with practical performance and provides open-source code, facilitating adoption in high-stakes domains where trust and interpretability are crucial.
Abstract
Interpretability and transparency are essential for incorporating causal effect models from observational data into policy decision-making. They can provide trust for the model in the absence of ground truth labels to evaluate the accuracy of such models. To date, attempts at transparent causal effect estimation consist of applying post hoc explanation methods to black-box models, which are not interpretable. Here, we present BICauseTree: an interpretable balancing method that identifies clusters where natural experiments occur locally. Our approach builds on decision trees with a customized objective function to improve balancing and reduce treatment allocation bias. Consequently, it can additionally detect subgroups presenting positivity violations, exclude them, and provide a covariate-based definition of the target population we can infer from and generalize to. We evaluate the method's performance using synthetic and realistic datasets, explore its bias-interpretability tradeoff, and show that it is comparable with existing approaches.
