Zero-shot causal learning
Hamed Nilforoshan, Michael Moor, Yusuf Roohani, Yining Chen, Anja Šurina, Michihiro Yasunaga, Sara Oblak, Jure Leskovec
TL;DR
CaML introduces zero-shot causal learning by framing intervention-specific CATE estimation as meta-learning tasks and synthesizing natural experiments to produce pseudo-outcomes. A single meta-model fuses intervention attributes ($W$) with individual covariates ($X$) to predict personalized effects $\tau_w(x)$ for unseen interventions, supported by a theoretical generalization bound and a Reptile-inspired training regime. Empirical results on large-scale medical claims and LINCS cell-line data show CaML surpasses strong zero-shot baselines and matches or exceeds baselines trained on test interventions, highlighting its ability to generalize across thousands of interventions and even to unseen drug combinations. This framework enables principled, scalable prediction of novel intervention effects, with significant implications for personalized medicine, policy design, and drug discovery.
Abstract
Predicting how different interventions will causally affect a specific individual is important in a variety of domains such as personalized medicine, public policy, and online marketing. There are a large number of methods to predict the effect of an existing intervention based on historical data from individuals who received it. However, in many settings it is important to predict the effects of novel interventions (e.g., a newly invented drug), which these methods do not address. Here, we consider zero-shot causal learning: predicting the personalized effects of a novel intervention. We propose CaML, a causal meta-learning framework which formulates the personalized prediction of each intervention's effect as a task. CaML trains a single meta-model across thousands of tasks, each constructed by sampling an intervention, its recipients, and its nonrecipients. By leveraging both intervention information (e.g., a drug's attributes) and individual features~(e.g., a patient's history), CaML is able to predict the personalized effects of novel interventions that do not exist at the time of training. Experimental results on real world datasets in large-scale medical claims and cell-line perturbations demonstrate the effectiveness of our approach. Most strikingly, \method's zero-shot predictions outperform even strong baselines trained directly on data from the test interventions.
