Exogenous Matching: Learning Good Proposals for Tractable Counterfactual Estimation
Yikang Chen, Dehui Du, Lili Tian
TL;DR
This work tackles tractable estimation of counterfactual expressions in general SCM settings by introducing Exogenous Matching (EXOM), an importance-sampling framework that minimizes a common variance upper bound via learning a conditional proposal Q_{U|Y_*}. The core theoretical contribution is a bound on the variance of the estimator, guiding a learning objective that reframes counterfactual estimation as conditional density modeling, with a stochastic counterfactual process enabling reuse across multiple queries. The method incorporates counterfactual Markov boundaries as structural priors and integrates with identifiable proxy SCMs, achieving unbiased or low-bias estimation in practical scenarios and outperforming standard IS baselines. Empirically, EXOM demonstrates superior sampling efficiency (ESP) and lower failure rates (FR) across diverse SCMs and density estimators, and remains applicable to real-world problems through proxy SCMs like CausalNF and NCM, albeit with acknowledged limitations such as reliance on partially specified models and faithfulness assumptions.
Abstract
We propose an importance sampling method for tractable and efficient estimation of counterfactual expressions in general settings, named Exogenous Matching. By minimizing a common upper bound of counterfactual estimators, we transform the variance minimization problem into a conditional distribution learning problem, enabling its integration with existing conditional distribution modeling approaches. We validate the theoretical results through experiments under various types and settings of Structural Causal Models (SCMs) and demonstrate the outperformance on counterfactual estimation tasks compared to other existing importance sampling methods. We also explore the impact of injecting structural prior knowledge (counterfactual Markov boundaries) on the results. Finally, we apply this method to identifiable proxy SCMs and demonstrate the unbiasedness of the estimates, empirically illustrating the applicability of the method to practical scenarios.
