Smaller Confidence Intervals From IPW Estimators via Data-Dependent Coarsening
Alkis Kalavasis, Anay Mehrotra, Manolis Zampetakis
TL;DR
This work addresses the fragility of inverse propensity-score weighted estimators to inaccuracies in propensity scores and the presence of outliers in covariates. It introduces Coarse IPW (CIPW) estimators, defined via data-dependent partitions of the covariate space, and proves that under mild Lipschitz and sparsity conditions, a learning procedure can achieve an RMSE of order $O\left(\varepsilon + 1/\sqrt{n}\right)$ when propensity scores are perturbed by at most $\varepsilon$, and a robust RMSE of similar order in the presence of outliers. The authors establish both algorithmic guarantees and fundamental hardness results: Min-RMSE is NP-hard to optimize or approximate, and learning good-local partitions from finite data is information-theoretically hard without data dependence. They show asymptotic normality for CIPW estimators built on good-local partitions and provide explicit bias and variance bounds that justify the robustness of CIPW. Overall, CIPW offers provably robust, data-dependent confidence intervals for the average treatment effect in observational studies, outperforming traditional IPW and trimmed-IPW approaches and aligning with information-theoretic lower bounds in favorable regimes.
Abstract
Inverse propensity-score weighted (IPW) estimators are prevalent in causal inference for estimating average treatment effects in observational studies. Under unconfoundedness, given accurate propensity scores and $n$ samples, the size of confidence intervals of IPW estimators scales down with $n$, and, several of their variants improve the rate of scaling. However, neither IPW estimators nor their variants are robust to inaccuracies: even if a single covariate has an $\varepsilon>0$ additive error in the propensity score, the size of confidence intervals of these estimators can increase arbitrarily. Moreover, even without errors, the rate with which the confidence intervals of these estimators go to zero with $n$ can be arbitrarily slow in the presence of extreme propensity scores (those close to 0 or 1). We introduce a family of Coarse IPW (CIPW) estimators that captures existing IPW estimators and their variants. Each CIPW estimator is an IPW estimator on a coarsened covariate space, where certain covariates are merged. Under mild assumptions, e.g., Lipschitzness in expected outcomes and sparsity of extreme propensity scores, we give an efficient algorithm to find a robust estimator: given $\varepsilon$-inaccurate propensity scores and $n$ samples, its confidence interval size scales with $\varepsilon+1/\sqrt{n}$. In contrast, under the same assumptions, existing estimators' confidence interval sizes are $Ω(1)$ irrespective of $\varepsilon$ and $n$. Crucially, our estimator is data-dependent and we show that no data-independent CIPW estimator can be robust to inaccuracies.
