Model-Agnostic Bounds for Augmented Inverse Probability Weighted Estimators' Wald-Confidence Interval Coverage in Randomized Controlled Trials
Hongxiang Qiu
TL;DR
This work analyzes Wald-confidence-interval coverage for augmented inverse probability weighted estimators in randomized trials, focusing on nonparametric nuisance estimators and a known propensity score. It derives non-asymptotic, model-agnostic Berry-Esseen-type bounds for CI coverage, with explicit dependence on cross-fitting, nuisance-estimator convergence, and function-class complexity. The study reveals that cross-fitting can improve CI coverage through reduced empirical-process error and potential overestimation of variance, while non-cross-fitting may lead to slower or biased coverage, especially under rich nuisance-function classes. Simulations corroborate that cross-fit AIPW methods, particularly CVSL, offer favorable variance behavior and coverage near the nominal level in moderate samples, providing practical guidance for causal inference in RCT settings with flexible nuisance estimation.
Abstract
Nonparametric estimators, such as the augmented inverse probability weighted (AIPW) estimator, have become increasingly popular in causal inference. Numerous nonparametric estimators have been proposed, but they are all asymptotically normal with the same asymptotic variance under similar conditions, leaving little guidance for practitioners to choose an estimator. In this paper, I focus on another important perspective of their asymptotic behaviors beyond asymptotic normality, the convergence of the Wald-confidence interval (CI) coverage to the nominal coverage. Such results have been established for simpler estimators (e.g., the Berry-Esseen Theorem), but are lacking for nonparametric estimators. I consider a simple but practical setting where the AIPW estimator based on a black-box nuisance estimator, with or without cross-fitting, is used to estimate the average treatment effect in randomized controlled trials. I derive non-asymptotic Berry-Esseen-type bounds on the difference between Wald-CI coverage and the nominal coverage. I also analyze the bias of variance estimators, showing that the cross-fit variance estimator might overestimate while the non-cross-fit variance estimator might underestimate, which might explain why cross-fitting has been empirically observed to improve Wald-CI coverage.
