Transferring Causal Effects using Proxies
Manuel Iglesias-Alonso, Felix Schur, Julius von Kügelgen, Jonas Peters
TL;DR
This work addresses estimating the causal effect of a treatment $X$ on an outcome $Y$ when an unobserved confounder $U$ induces domain shifts and only a proxy $W$ of $U$ is observed in the target domain. By leveraging data from multiple source domains and assuming sufficient proxy informativeness (rank/invertibility of $P(W|E,x)$), the target interventional distribution $q(y|do(x))$ becomes identifiable even with discrete or continuous $U$. The authors propose two estimators with consistency guarantees, one achieving asymptotic normality and valid confidence intervals, and validate them through simulations and a hotel-ranking application. The results demonstrate practical viability for transferring causal effects across domains using proxies, with theoretical identifiability conditions and robust inference procedures. This framework advances causal domain adaptation by enabling estimation of interventions in unseen settings where only proxy measurements are available.
Abstract
We consider the problem of estimating a causal effect in a multi-domain setting. The causal effect of interest is confounded by an unobserved confounder and can change between the different domains. We assume that we have access to a proxy of the hidden confounder and that all variables are discrete or categorical. We propose methodology to estimate the causal effect in the target domain, where we assume to observe only the proxy variable. Under these conditions, we prove identifiability (even when treatment and response variables are continuous). We introduce two estimation techniques, prove consistency, and derive confidence intervals. The theoretical results are supported by simulation studies and a real-world example studying the causal effect of website rankings on consumer choices.
