Clustered Switchback Designs for Experimentation Under Spatio-temporal Interference
Su Jia, Nathan Kallus, Christina Lee Yu
TL;DR
The paper addresses causal experimentation under simultaneous spatio-temporal interference and non-stationarity by proposing a clustered switchback design and a radius-$r$ truncated Horvitz-Thompson estimator. It proves an $\tilde{O}(1/(NT))$ MSE bound (up to a graph-dependent factor) for graphs with good clustering, unifying results for pure switchback and A/B testing under interference. The framework uses a Markovian state evolution with rapid mixing and derives explicit bias and variance bounds via a dependence graph, with results specialized to sparse, $\kappa$-restricted growth, and spatial graphs. Simulations across single- and multi-unit settings validate the theoretical rates and demonstrate practical advantages of clustering both in space and time.
Abstract
We consider experimentation in the presence of non-stationarity, inter-unit (spatial) interference, and carry-over effects (temporal interference), where we wish to estimate the global average treatment effect (GATE), the difference between average outcomes having exposed all units at all times to treatment or to control. We suppose spatial interference is described by a graph, where a unit's outcome depends on its neighborhood's treatments, and that temporal interference is described by an MDP, where the transition kernel under either treatment (action) satisfies a rapid mixing condition. We propose a clustered switchback design, where units are grouped into clusters and time steps are grouped into blocks, and each whole cluster-block combination is assigned a single random treatment. Under this design, we show that for graphs that admit good clustering, a truncated Horvitz-Thompson estimator achieves a $\tilde O(1/NT)$ mean squared error (MSE), matching the lower bound up to logarithmic terms for sparse graphs. Our results simultaneously generalize the results from \citet{hu2022switchback,ugander2013graph} and \citet{leung2022rate}. Simulation studies validate the favorable performance of our approach.
