Disentangle Estimation of Causal Effects from Cross-Silo Data
Yuxuan Liu, Haozhao Wang, Shuang Wang, Zhiming He, Wenchao Xu, Jialiang Zhu, Fan Yang
TL;DR
The paper tackles cross-silo causal inference with heterogeneous feature spaces under privacy constraints. It introduces FedDCI, a disentangled architecture that uses shared and private branches and a global constraint to transfer causal information while keeping local data local. A KL-based encoder aligns latent representations across silos and a coordinated optimization strategy yields convergence guarantees under standard non-convex assumptions, with detailed analysis showing a sublinear convergence rate. Empirical results on semi-synthetic Twins and IHDP datasets demonstrate that FedDCI outperforms state-of-the-art baselines in non-IID cross-silo settings, highlighting its practical potential for private, multi-domain causal effect estimation.
Abstract
Estimating causal effects among different events is of great importance to critical fields such as drug development. Nevertheless, the data features associated with events may be distributed across various silos and remain private within respective parties, impeding direct information exchange between them. This, in turn, can result in biased estimations of local causal effects, which rely on the characteristics of only a subset of the covariates. To tackle this challenge, we introduce an innovative disentangle architecture designed to facilitate the seamless cross-silo transmission of model parameters, enriched with causal mechanisms, through a combination of shared and private branches. Besides, we introduce global constraints into the equation to effectively mitigate bias within the various missing domains, thereby elevating the accuracy of our causal effect estimation. Extensive experiments conducted on new semi-synthetic datasets show that our method outperforms state-of-the-art baselines.
