PITE: Multi-Prototype Alignment for Individual Treatment Effect Estimation
Fuyuan Cao, Jiaxuan Zhang, Xiaoli Li
TL;DR
PITE tackles the challenge of estimating individual treatment effects from observational data by preserving local subgroup structure through a novel multi-prototype framework. It introduces within-group prototype matching and cross-group prototype alignment, optimized with a composite loss that includes clustering, alignment, and diversity terms, plus a two-head predictor for potential outcomes. Across synthetic, IHDP, and Jobs datasets, PITE achieves superior ITE accuracy and robustness compared with 13 baselines, supported by ablation and uniformity analyses. This prototype-level approach reduces distribution shift while maintaining local structure, enabling more reliable counterfactual estimation with potential for broader applicability in personalized decision-making.
Abstract
Estimating Individual Treatment Effects (ITE) from observational data is challenging due to confounding bias. Most studies tackle this bias by balancing distributions globally, but ignore individual heterogeneity and fail to capture the local structure that represents the natural clustering among individuals, which ultimately compromises ITE estimation. While instance-level alignment methods consider heterogeneity, they similarly overlook the local structure information. To address these issues, we propose an end-to-end Multi-\textbf{P}rototype alignment method for \textbf{ITE} estimation (\textbf{PITE}). PITE effectively captures local structure within groups and enforces cross-group alignment, thereby achieving robust ITE estimation. Specifically, we first define prototypes as cluster centroids based on similar individuals under the same treatment. To identify local similarity and the distribution consistency, we perform instance-to-prototype matching to assign individuals to the nearest prototype within groups, and design a multi-prototype alignment strategy to encourage the matched prototypes to be close across treatment arms in the latent space. PITE not only reduces distribution shift through fine-grained, prototype-level alignment, but also preserves the local structures of treated and control groups, which provides meaningful constraints for ITE estimation. Extensive evaluations on benchmark datasets demonstrate that PITE outperforms 13 state-of-the-art methods, achieving more accurate and robust ITE estimation.
