Text Rationalization for Robust Causal Effect Estimation
Lijinghua Zhang, Hengrui Cai
TL;DR
This work tackles the challenge of estimating causal effects from text data by addressing positivity violations that arise when high-dimensional textual confounders inflate the covariate space. It introduces Confounding-Aware Token Rationalization (CATR), which learns a sparse subset of tokens via a differentiable selector and HSIC-based residual-dependence penalties to preserve confounding information while improving overlap. The authors provide nonasymptotic error bounds for neural nuisance estimators and establish sqrt(n)-consistency and asymptotic normality for the AIPW estimator under the framework, and validate the approach on semi-synthetic data and a real MIMIC-III sepsis cohort, showing improved stability, overlap, and interpretability. The method supports multimodal extensions to include structured covariates and yields interpretable token selections that highlight clinically relevant confounders. Overall, CATR offers a robust and interpretable toolkit for text-based causal adjustment with practical impact in healthcare analytics and beyond.
Abstract
Recent advances in natural language processing have enabled the increasing use of text data in causal inference, particularly for adjusting confounding factors in treatment effect estimation. Although high-dimensional text can encode rich contextual information, it also poses unique challenges for causal identification and estimation. In particular, the positivity assumption, which requires sufficient treatment overlap across confounder values, is often violated at the observational level, when massive text is represented in feature spaces. Redundant or spurious textual features inflate dimensionality, producing extreme propensity scores, unstable weights, and inflated variance in effect estimates. We address these challenges with Confounding-Aware Token Rationalization (CATR), a framework that selects a sparse necessary subset of tokens using a residual-independence diagnostic designed to preserve confounding information sufficient for unconfoundedness. By discarding irrelevant texts while retaining key signals, CATR mitigates observational-level positivity violations and stabilizes downstream causal effect estimators. Experiments on synthetic data and a real-world study using the MIMIC-III database demonstrate that CATR yields more accurate, stable, and interpretable causal effect estimates than existing baselines.
