Probabilities of Causation for Continuous and Vector Variables
Yuta Kawakami, Manabu Kuroki, Jin Tian
TL;DR
This work extends probabilities of causation (PoC) to continuous and vector-valued treatments and outcomes, introducing nonparametric identification theorems under exogeneity and monotonicity-like assumptions. It develops scalar PoC definitions and their identifiability, then generalizes to multivariate outcomes using a lexicographic order and conditional CDFs, enabling PoC analysis with covariates. The authors further propose PoC variants with evidence and multi-hypothetical terms, provide corresponding identification results, and demonstrate the approach on a real-world education dataset. Overall, the framework broadens causal reasoning with continuous and high-dimensional effects, offering practical tools for decision-making and explainable AI.
Abstract
Probabilities of causation (PoC) are valuable concepts for explainable artificial intelligence and practical decision-making. PoC are originally defined for scalar binary variables. In this paper, we extend the concept of PoC to continuous treatment and outcome variables, and further generalize PoC to capture causal effects between multiple treatments and multiple outcomes. In addition, we consider PoC for a sub-population and PoC with multi-hypothetical terms to capture more sophisticated counterfactual information useful for decision-making. We provide a nonparametric identification theorem for each type of PoC we introduce. Finally, we illustrate the application of our results on a real-world dataset about education.
