Causal Partial Identification via Conditional Optimal Transport
Sirui Lin, Zijun Gao, Jose Blanchet, Peter Glynn
TL;DR
The paper tackles partial identification of causal estimands that depend on the joint distribution of potential outcomes by introducing covariate-assisted sets via Conditional Optimal Transport (COT). It proves continuity of the COT functional under the adapted Wasserstein distance and proposes a direct, nonparametric adapted COT value estimator that avoids nuisance-function estimation, with provable consistency and finite-sample rates. Through simulations and a STAR data-inspired study, the method demonstrates robust, competitive performance against existing bounds-based approaches, particularly when covariates play a strong role. The work also discusses extensions to covariate-shift scenarios, triangular transport maps, and multi-treatment settings, highlighting practical implications for covariate-informed causal inference and distributional bounding.
Abstract
We study the estimation of causal estimand involving the joint distribution of treatment and control outcomes for a single unit. In typical causal inference settings, it is impossible to observe both outcomes simultaneously, which places our estimation within the domain of partial identification (PI). Pre-treatment covariates can substantially reduce estimation uncertainty by shrinking the partially identified set. Recent work has shown that covariate-assisted PI sets can be characterized through conditional optimal transport (COT) problems. However, finite-sample estimation of COT poses significant challenges, primarily because the COT functional is discontinuous under the weak topology, rendering the direct plug-in estimator inconsistent. To address this issue, existing literature relies on relaxations or indirect methods involving the estimation of non-parametric nuisance statistics. In this work, we demonstrate the continuity of the COT functional under a stronger topology induced by the adapted Wasserstein distance. Leveraging this result, we propose a direct, consistent, non-parametric estimator for COT value that avoids nuisance parameter estimation. We derive the convergence rate for our estimator and validate its effectiveness through comprehensive simulations, demonstrating its improved performance compared to existing approaches.
