Restricted Block Permutation for Two-Sample Testing
Jungwoo Ho
TL;DR
<p>This work introduces a block-restricted one-swap permutation framework for two-sample testing that preserves exact finite-sample validity while concentrating permutation changes along high-contrast, block-restricted paths. By modeling the permutation trajectory as a bounded martingale difference sequence and applying Bernstein–Freedman concentration, the authors derive data-dependent tail bounds and show that increment variances scale as $O(h^2)$ for key statistics, a contraction relative to full relabeling. This variance contraction translates into substantially smaller permutation critical values and improved power for canonical statistics such as the difference in means and the unbiased $\widehat{MMD}^2$, with explicit formulas for the data-dependent power and critical values. The paper also provides practical design guidelines (block formation, complementary block–pair swaps, and choice of representative ratio $\rho$) and supports the theory with simulations demonstrating higher power while maintaining exact type-I error control.</p>
Abstract
We study a structured permutation scheme for two-sample testing that restricts permutations to single cross-swaps between block-selected representatives. Our analysis yields three main results. First, we provide an exact validity construction that applies to any fixed restricted permutation set. Second, for both the difference of sample means and the unbiased $\widehat{\mathrm{MMD}}^{2}$ estimator, we derive closed-form one-swap increment identities whose conditional variances scale as $O(h^{2})$, in contrast to the $Θ(h)$ increment variability under full relabeling. This increment-level variance contraction sharpens the Bernstein--Freedman variance proxy and leads to substantially smaller permutation critical values. Third, we obtain explicit, data-dependent expressions for the resulting critical values and statistical power. Together, these results show that block-restricted one-swap permutations can achieve strictly higher power than classical full permutation tests while maintaining exact finite-sample validity, without relying on pessimistic worst-case Lipschitz bounds.
