Meta-Learners for Partially-Identified Treatment Effects Across Multiple Environments
Jonas Schweisthal, Dennis Frauen, Mihaela van der Schaar, Stefan Feuerriegel
TL;DR
The paper tackles partial identification of conditional average treatment effects (CATE) when data come from multiple environments and standard assumptions (overlap, unconfoundedness) may fail. By treating the environment as an instrumental variable, it derives environment-aware bounds for the CATE and proposes model-agnostic meta-learners to estimate these bounds, including naïve plug-in and two-stage WB/CB learners with theoretical guarantees such as consistency and double robustness. Empirical results on synthetic and real-world data show that the proposed bounds are valid and can be tightened by cross-environment information, with cross-environment, doubly robust learners often performing best in complex settings. The methods generalize to IV settings like randomized trials with non-compliance and offer a practical pathway for robust causal inference in heterogeneous environments, while highlighting careful interpretation under IV assumptions.
Abstract
Estimating the conditional average treatment effect (CATE) from observational data is relevant for many applications such as personalized medicine. Here, we focus on the widespread setting where the observational data come from multiple environments, such as different hospitals, physicians, or countries. Furthermore, we allow for violations of standard causal assumptions, namely, overlap within the environments and unconfoundedness. To this end, we move away from point identification and focus on partial identification. Specifically, we show that current assumptions from the literature on multiple environments allow us to interpret the environment as an instrumental variable (IV). This allows us to adapt bounds from the IV literature for partial identification of CATE by leveraging treatment assignment mechanisms across environments. Then, we propose different model-agnostic learners (so-called meta-learners) to estimate the bounds that can be used in combination with arbitrary machine learning models. We further demonstrate the effectiveness of our meta-learners across various experiments using both simulated and real-world data. Finally, we discuss the applicability of our meta-learners to partial identification in instrumental variable settings, such as randomized controlled trials with non-compliance.
