Finely Stratified Rerandomization Designs

Max Cytrynbaum

Finely Stratified Rerandomization Designs

Max Cytrynbaum

TL;DR

The paper develops a theory for finely stratified rerandomization that combines tight pre-match grouping with within-group rerandomization to satisfy balance on nonlinear covariate features. It shows that such designs implement partially linear regression adjustment by design, yielding nonparametric control over stratification covariates and linear control over rerandomization covariates, and derives GMM-based asymptotics for both finite and superpopulation estimands. It introduces nonlinear rerandomization variants and proves their asymptotic equivalence to linear designs, then optimizes the acceptance region via a minimax criterion, potentially leveraging pilot data. The authors provide a framework for robust inference under stratified rerandomization, including variance bounds for finite-population parameters and ex-post adjustments that restore normality, with simulations and an Angrist 2013 application illustrating gains in estimating treatment-effect heterogeneity. Overall, the work expands the toolkit for causal inference under data-adaptive stratification, offering practical guidance on design choices and inference procedures that exploit both nonparametric balance and semiparametric efficiency.

Abstract

We study estimation and inference on causal parameters under finely stratified rerandomization designs, which use baseline covariates to match units into groups (e.g. matched pairs), then rerandomize within-group treatment assignments until a balance criterion is satisfied. We show that finely stratified rerandomization does partially linear regression adjustment by design, providing nonparametric control over the stratified covariates and linear control over the rerandomized covariates. We introduce several new forms of rerandomization, allowing for imbalance metrics based on nonlinear estimators, and proposing a minimax scheme that minimizes the computational cost of rerandomization subject to a bound on estimation error. While the asymptotic distribution of GMM estimators under stratified rerandomization is generically non-normal, we show how to restore asymptotic normality using ex-post linear adjustment tailored to the stratification. We derive new variance bounds that enable conservative inference on finite population causal parameters, and provide asymptotically exact inference on their superpopulation counterparts.

Finely Stratified Rerandomization Designs

TL;DR

Abstract

Paper Structure (15 sections, 14 theorems, 29 equations, 2 figures)

This paper contains 15 sections, 14 theorems, 29 equations, 2 figures.

Introduction
Related Literature
Framework and Designs
Asymptotics for GMM Estimation
Finite Population Estimand
Superpopulation Estimand
Equivalence with Partially Linear Adjustment
Nonlinear Rerandomization
GMM Rerandomization
Propensity Score Rerandomization
Optimizing Acceptance Regions
Minimax Rerandomization
Minimizing Computational Cost
Beliefs From Pilot Data
Restoring Normality

Key Result

Lemma 3.4

Suppose $D_{1:n}$ as in Definition defn:rerandomization and require Assumption assumption:linear-rerandomization, assumption-gmm. Then $\sqrt{n}(\widehat{\theta} - \theta_n) = \sqrt{n} E_n[H_i \Pi a(W_i, \theta_0)] + o_p(1)$.

Figures (2)

Figure 1: Propensity rerandomization (Definition \ref{['defn:propensity-rerandomization']}) with $p = 1/2$ for $Z \sim \mathop{\mathrm{Unif}}\nolimits[0, 1]$ and $X = B(Z)$ a B-spline basis. LHS: $D_{1:n}$ and estimated propensity with $\widehat{p}(Z) \ll 1/2$, for $Z \in [0.4, 0.9]$, showing imbalance. RHS: Accepted allocation $D_{1:n}$ with $\mathcal{J}_n \leq \epsilon$
Figure 2: Prior information $B$ and $A_0 = \epsilon B^{\circ}$ for Example \ref{['ex:ellipse']}.

Theorems & Definitions (34)

Definition 2.1: Stratified Rerandomization
Example 2.2: Pure Stratification
Example 2.3: Complete Randomization
Example 2.4: Mahalanobis Rerandomization
Definition 2.5: Causal Estimands
Remark 2.6: Finite Population
Example 2.7: ATE and SATE
Example 2.8: LATE Heterogeneity
Example 3.3: ATE and SATE
Lemma 3.4: Linearization
...and 24 more

Finely Stratified Rerandomization Designs

TL;DR

Abstract

Finely Stratified Rerandomization Designs

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (34)