Sharp variance estimator and causal bootstrap in stratified randomized experiments
Haoyang Yu, Ke Zhu, Hanzhong Liu
TL;DR
This work advances inference in stratified randomized experiments by introducing a sharp, Consistent variance estimator and two design-based causal bootstrap methods. The rank-preserving imputation bootstrap offers second-order refinement over normal approximation in strata with sufficient sizes, while the constant-treatment-effect imputation extends to paired designs. Through theory (consistency, Edgeworth expansions) and extensive simulations plus real-data applications, the authors show improved finite-sample coverage and shorter confidence intervals, particularly under nonnormal or heavy-tailed outcomes. An accompanying R package, CausalBootstrap, enables practical implementation. These methods enhance reliable, model-lean causal inference in complex randomization schemes.
Abstract
Randomized experiments are the gold standard for estimating treatment effects, and randomization serves as a reasoned basis for inference. In widely used stratified randomized experiments, randomization-based finite-population asymptotic theory enables valid inference for the average treatment effect, relying on normal approximation and a Neyman-type conservative variance estimator. However, when the sample size is small or the outcomes are skewed, the Neyman-type variance estimator may become overly conservative, and the normal approximation can fail. To address these issues, we propose a sharp variance estimator and two causal bootstrap methods to more accurately approximate the sampling distribution of the weighted difference-in-means estimator in stratified randomized experiments. The first causal bootstrap procedure is based on rank-preserving imputation and we prove its second-order refinement over normal approximation. The second causal bootstrap procedure is based on constant-treatment-effect imputation and is further applicable in paired experiments. In contrast to traditional bootstrap methods, where randomness originates from hypothetical super-population sampling, our analysis for the proposed causal bootstrap is randomization-based, relying solely on the randomness of treatment assignment in randomized experiments. Numerical studies and two real data applications demonstrate advantages of our proposed methods in finite samples. The \texttt{R} package \texttt{CausalBootstrap} implementing our method is publicly available.
