Partial counterfactual identification and uplift modeling: theoretical results and real-world assessment
Théo Verhelst, Denis Mercier, Jeevan Shrestha, Gianluca Bontempi
TL;DR
The paper develops uplift-based bounds for the probability of counterfactual events under unconfoundedness, extending Fréchet bounds to the four counterfactual joint outcomes through S0 and S1. It introduces a point estimator for counterfactual probabilities assuming Y0 and Y1 are conditionally independent given X, and provides a hierarchical Bayesian simulator for validation. Through simulation, it shows the uplift bounds are tighter than Fréchet bounds and demonstrates reasonable estimator accuracy; it then validates the approach on a real telecom churn dataset, highlighting practical business insights and limitations. The work offers a practical framework for partial counterfactual identification using uplift modeling, with potential for refinement via more informative features and observational data.
Abstract
Counterfactuals are central in causal human reasoning and the scientific discovery process. The uplift, also called conditional average treatment effect, measures the causal effect of some action, or treatment, on the outcome of an individual. This paper discusses how it is possible to derive bounds on the probability of counterfactual statements based on uplift terms. First, we derive some original bounds on the probability of counterfactuals and we show that tightness of such bounds depends on the information of the feature set on the uplift term. Then, we propose a point estimator based on the assumption of conditional independence between the counterfactual outcomes. The quality of the bounds and the point estimators are assessed on synthetic data and a large real-world customer data set provided by a telecom company, showing significant improvement over the state of the art.
