Measures for Assessing Causal Effect Heterogeneity Unexplained by Covariates
Yuta Kawakami, Jin Tian
TL;DR
The paper addresses causal effect heterogeneity that remains after conditioning on covariates by introducing P-CACE and N-CACE for binary X with continuous Y, and P-CPICE and N-CPICE for continuous X with continuous Y under stochastic interventions. It develops principled identifications and sharp bounds for these measures within an SCM framework, and shows how CACE decomposes into positive and negative components, with a binary outcome recovery to THR/TBR. Through both theoretical results and a real-world medical dataset, the authors demonstrate that substantial heterogeneity can exist even when CACE is positive, highlighting the value of examining positive/negative subpopulations and stochastic-intervention effects for personalized interventions. The work thus broadens the causal-heterogeneity toolbox, offering identification, bounding, estimation guidance, and practical illustrations for nuanced policy decisions. Overall, the measures enable a more granular understanding of who benefits or suffers from an intervention, guiding targeted and tempered implementation in practice.
Abstract
There has been considerable interest in estimating heterogeneous causal effects across individuals or subpopulations. Researchers often assess causal effect heterogeneity based on the subjects' covariates using the conditional average causal effect (CACE). However, substantial heterogeneity may persist even after accounting for the covariates. Existing work on causal effect heterogeneity unexplained by covariates mainly focused on binary treatment and outcome. In this paper, we introduce novel heterogeneity measures, P-CACE and N-CACE, for binary treatment and continuous outcome that represent CACE over the positively and negatively affected subjects, respectively. We also introduce new heterogeneity measures, P-CPICE and N-CPICE, for continuous treatment and continuous outcome by leveraging stochastic interventions, expanding causal questions that researchers can answer. We establish identification and bounding theorems for these new measures. Finally, we show their application to a real-world dataset.
