Two-Phase Treatment with Noncompliance: Identifying the Cumulative Average Treatment Effect via Multisite Instrumental Variables
Guanglei Hong, Xu Qin, Zhengyan Xu, Fan Yang
TL;DR
This work tackles the problem of identifying the cumulative average treatment effect (ATE) of a two-phase intervention in multisite randomized trials when noncompliance arises after Phase I and posttreatment confounding may bias outcomes. The authors introduce MS2T-IV, a multisite two-phase treatment IV framework that uses Phase I randomization as the instrument, accommodates a posttreatment mediator $V$, and allows for non-additive effects, thereby relaxing the exclusion restriction and sequential ignorability assumptions. Identification hinges on within-site ignorability, within-site zero covariance, and between-site independence, yielding the cumulant δ_{ATE} = γ_1 + γ_2 + γ_3 + θ_V α_1; estimation proceeds via site-level ITT models (Stage 1) and a between-site regression (Stage 2), with covariance-adjusted and bootstrap BCa inference. Simulations show MS2T-IV reduces bias and improves mean-squared error relative to naive or IPTW approaches, though inference with proper bootstrap is sometimes conservative; the method applied to Project STAR estimates a positive two-year small-class effect on Grade 1 outcomes, illustrating practical utility for evaluating multisite two-phase programs. Overall, the paper advances causal inference for multi-phase interventions by delivering a robust, less assumption-reliant approach to estimate cumulative ATEs in the presence of noncompliance and posttreatment confounding.
Abstract
When evaluating a two-phase intervention, the cumulative average treatment effect (ATE) is often the primary causal estimand of interest. However, some individuals who do not respond well to the Phase I treatment may subsequently display noncompliant behaviors. At the same time, exposure to the Phase I treatment is expected to directly influence an individual's potential outcomes, thereby violating the exclusion restriction. Building on an instrumental variable (IV) strategy for multisite trials, we clarify the conditions under which the cumulative ATE of a two-phase treatment can be identified by employing the random assignment of the Phase I treatment as the instrument. Our strategy relaxes both the conventional exclusion restriction and sequential ignorability assumptions. We assess the performance of the new strategy through simulation studies. Additionally, we reanalyze data from the Tennessee class size study, in which students and teachers were randomly assigned to either small or regular class types in kindergarten (Phase I) with noncompliance emerging in Grade 1 (Phase II). Applying our new strategy, we estimate the cumulative ATE of receiving two consecutive years of instruction in a small versus regular class.
