Distributional Treatment Effect Estimation across Heterogeneous Sites via Optimal Transport
Borna Bateni, Yubai Yuan, Qi Xu, Annie Qu
TL;DR
The paper addresses cross-site transportability of intervention effects by moving beyond average effects to full distributional treatment outcomes. It introduces a distributional causal inference framework that treats treatment and control as probability measures on joint feature-outcome spaces and models cross-site heterogeneity as a push-forward map $T$, learned via an optimal transport objective that aligns controls using a regularized fused Gromov-Wasserstein loss and a metric-learning step. It then transports the source treatment distribution to the target site to synthesize a target-treatment dataset, with rigorous guarantees of consistency and asymptotic convergence to the true target distribution. Empirical results on synthetic scenarios and a real PDX dataset demonstrate accurate reconstruction of the entire treatment-effect distribution and superiority over ATE-focused baselines, highlighting practical utility for cross-site causal inference.
Abstract
We propose a novel framework for synthesizing counterfactual treatment group data in a target site by integrating full treatment and control group data from a source site with control group data from the target. Departing from conventional average treatment effect estimation, our approach adopts a distributional causal inference perspective by modeling treatment and control as distinct probability measures on the source and target sites. We formalize the cross-site heterogeneity (effect modification) as a push-forward transformation that maps the joint feature-outcome distribution from the source to the target site. This transformation is learned by aligning the control group distributions between sites using an Optimal Transport-based procedure, and subsequently applied to the source treatment group to generate the synthetic target treatment distribution. Under general regularity conditions, we establish theoretical guarantees for the consistency and asymptotic convergence of the synthetic treatment group data to the true target distribution. Simulation studies across multiple data-generating scenarios and a real-world application to patient-derived xenograft data demonstrate that our framework robustly recovers the full distributional properties of treatment effects.
