Proximity Matters: Local Proximity Preserved Balancing for Treatment Effect Estimation
Hao Wang, Zhichao Chen, Yuan Shen, Jiajun Fan, Zhaoran Liu, Degui Yang, Xinggao Liu, Haoxuan Li
TL;DR
This work tackles the challenge of estimating heterogeneous treatment effects from observational data by acknowledging that global distribution alignment often ignores local unit similarities. It introduces Proximity-aware Counterfactual Regression (PCR), which combines a Local Proximity Preservation Regularizer (LPR) with an Informative Subspace Projector (ISP) under a fused Gromov-Wasserstein OT framework to better balance treated and control groups while mitigating the curse of dimensionality. The approach yields improved CATE estimation as measured by PEHE, ATE, and ATT across semi-synthetic IHDP and ACIC benchmarks, with ablations validating the contributions of both LPR and ISP. The results suggest PCR’s practical impact for bias mitigation in causal inference tasks and point to future work on integrating normalizing flows and deploying the method in industrial settings such as recommender systems.
Abstract
Heterogeneous treatment effect (HTE) estimation from observational data poses significant challenges due to treatment selection bias. Existing methods address this bias by minimizing distribution discrepancies between treatment groups in latent space, focusing on global alignment. However, the fruitful aspect of local proximity, where similar units exhibit similar outcomes, is often overlooked. In this study, we propose Proximity-aware Counterfactual Regression (PCR) to exploit proximity for representation balancing within the HTE estimation context. Specifically, we introduce a local proximity preservation regularizer based on optimal transport to depict the local proximity in discrepancy calculation. Furthermore, to overcome the curse of dimensionality that renders the estimation of discrepancy ineffective, exacerbated by limited data availability for HTE estimation, we develop an informative subspace projector, which trades off minimal distance precision for improved sample complexity. Extensive experiments demonstrate that PCR accurately matches units across different treatment groups, effectively mitigates treatment selection bias, and significantly outperforms competitors. Code is available at https://anonymous.4open.science/status/ncr-B697.
