Quantifying Aleatoric Uncertainty of the Treatment Effect: A Novel Orthogonal Learner
Valentyn Melnychuk, Stefan Feuerriegel, Mihaela van der Schaar
TL;DR
The paper tackles quantifying aleatoric uncertainty in the treatment effect by focusing on the covariate-conditioned distribution of the treatment effect (CDTE), which is not point identifiable. It develops a novel AU-learner that performs Neyman-orthogonal, bias-corrected estimation of the Makarov bounds on the CDTE’s CDF/quantiles, with theoretical guarantees of quasi-oracle efficiency. A neural instantiation (AU-CNFs) provides a practical, scalable implementation that jointly estimates nuisance functions and bounds via conditional normalizing flows. Experiments on synthetic and semi-synthetic benchmarks demonstrate improved accuracy in estimating CDTE bounds, and a real-case lockdown analysis illustrates potential for informing personalized medical and policy decisions.
Abstract
Estimating causal quantities from observational data is crucial for understanding the safety and effectiveness of medical treatments. However, to make reliable inferences, medical practitioners require not only estimating averaged causal quantities, such as the conditional average treatment effect, but also understanding the randomness of the treatment effect as a random variable. This randomness is referred to as aleatoric uncertainty and is necessary for understanding the probability of benefit from treatment or quantiles of the treatment effect. Yet, the aleatoric uncertainty of the treatment effect has received surprisingly little attention in the causal machine learning community. To fill this gap, we aim to quantify the aleatoric uncertainty of the treatment effect at the covariate-conditional level, namely, the conditional distribution of the treatment effect (CDTE). Unlike average causal quantities, the CDTE is not point identifiable without strong additional assumptions. As a remedy, we employ partial identification to obtain sharp bounds on the CDTE and thereby quantify the aleatoric uncertainty of the treatment effect. We then develop a novel, orthogonal learner for the bounds on the CDTE, which we call AU-learner. We further show that our AU-learner has several strengths in that it satisfies Neyman-orthogonality and, thus, quasi-oracle efficiency. Finally, we propose a fully-parametric deep learning instantiation of our AU-learner.
