Counterfactual Identifiability via Dynamic Optimal Transport
Fabio De Sousa Ribeiro, Ainkaran Santhirasekaram, Ben Glocker
TL;DR
This work addresses the challenge of identifying counterfactuals for high-dimensional multivariate outcomes from observational data by marrying continuous-time flows with dynamic optimal transport. It shows that under Markovian SCMs and standard regularity, the dynamic OT flow yields a unique, monotone counterfactual transport map that preserves rank in multivariate settings, extending the Monge–Kantorovich framework via the Brenier gradient map and the Benamou–Brenier formulation. The authors extend identifiability results to non-Markovian regimes (IV, BC, FC) and demonstrate that a Markovian OT coupling can be learned in practice through flow matching, enabling counterfactual inference without paired data. Empirically, they validate the theory on a counterfactual ellipse dataset and a MIMIC Chest X-ray study, showing improved counterfactual soundness (composition, effectiveness, reversibility) over prior methods and highlighting practical considerations for OT scaling. Overall, the paper provides a rigorous foundation for high-dimensional counterfactual identifiability and a practical CF inference pipeline with strong implications for causal reasoning in imaging and beyond, while noting current scalability limitations of large-scale OT.
Abstract
We address the open question of counterfactual identification for high-dimensional multivariate outcomes from observational data. Pearl (2000) argues that counterfactuals must be identifiable (i.e., recoverable from the observed data distribution) to justify causal claims. A recent line of work on counterfactual inference shows promising results but lacks identification, undermining the causal validity of its estimates. To address this, we establish a foundation for multivariate counterfactual identification using continuous-time flows, including non-Markovian settings under standard criteria. We characterise the conditions under which flow matching yields a unique, monotone and rank-preserving counterfactual transport map with tools from dynamic optimal transport, ensuring consistent inference. Building on this, we validate the theory in controlled scenarios with counterfactual ground-truth and demonstrate improvements in axiomatic counterfactual soundness on real images.
