Table of Contents
Fetching ...

Residual permutation test for regression coefficient testing

Kaiyue Wen, Tengyao Wang, Yuhao Wang

TL;DR

This work introduces the Residual Permutation Test (RPT) for testing a single regression coefficient in a fixed-design linear model $\boldsymbol{Y}=\boldsymbol{X}\beta+b\boldsymbol{Z}+\boldsymbol{\varepsilon}$ with $p$ allowed to grow proportionally with $n$, achieving finite-sample size control under exchangeable noise when $p<n/2$. RPT builds a permutation-based p-value by projecting residuals onto the intersection of subspaces associated with the original and permuted designs, and proves uniform validity of the p-value under the null without requiring Gaussianity. It further analyzes statistical power under heavy-tailed noise with finite $(1+t)$-th moments, showing detectability when $|b| \ge c n^{-t/(1+t)}$ (or $|b|=\omega(n^{-1/2})$ if $t=1$), and proves these rates are essentially minimax-optimal. Numerical experiments corroborate finite-sample validity and demonstrate competitive power across a range of noise tails, with a variant RPT$_{EM}$ offering improved performance in some heavy-tailed scenarios. The paper also discusses extensions to diverging permutation sets, broader alternatives, and nonlinear relations, providing a comprehensive theoretical and empirical framework for robust coefficient testing in high-dimensional linear models.

Abstract

We consider the problem of testing whether a single coefficient is equal to zero in linear models when the dimension of covariates $p$ can be up to a constant fraction of sample size $n$. In this regime, an important topic is to propose tests with finite-sample valid size control without requiring the noise to follow strong distributional assumptions. In this paper, we propose a new method, called residual permutation test (RPT), which is constructed by projecting the regression residuals onto the space orthogonal to the union of the column spaces of the original and permuted design matrices. RPT can be proved to achieve finite-population size validity under fixed design with just exchangeable noises, whenever $p < n / 2$. Moreover, RPT is shown to be asymptotically powerful for heavy tailed noises with bounded $(1+t)$-th order moment when the true coefficient is at least of order $n^{-t/(1+t)}$ for $t \in [0,1]$. We further proved that this signal size requirement is essentially rate-optimal in the minimax sense. Numerical studies confirm that RPT performs well in a wide range of simulation settings with normal and heavy-tailed noise distributions.

Residual permutation test for regression coefficient testing

TL;DR

This work introduces the Residual Permutation Test (RPT) for testing a single regression coefficient in a fixed-design linear model with allowed to grow proportionally with , achieving finite-sample size control under exchangeable noise when . RPT builds a permutation-based p-value by projecting residuals onto the intersection of subspaces associated with the original and permuted designs, and proves uniform validity of the p-value under the null without requiring Gaussianity. It further analyzes statistical power under heavy-tailed noise with finite -th moments, showing detectability when (or if ), and proves these rates are essentially minimax-optimal. Numerical experiments corroborate finite-sample validity and demonstrate competitive power across a range of noise tails, with a variant RPT offering improved performance in some heavy-tailed scenarios. The paper also discusses extensions to diverging permutation sets, broader alternatives, and nonlinear relations, providing a comprehensive theoretical and empirical framework for robust coefficient testing in high-dimensional linear models.

Abstract

We consider the problem of testing whether a single coefficient is equal to zero in linear models when the dimension of covariates can be up to a constant fraction of sample size . In this regime, an important topic is to propose tests with finite-sample valid size control without requiring the noise to follow strong distributional assumptions. In this paper, we propose a new method, called residual permutation test (RPT), which is constructed by projecting the regression residuals onto the space orthogonal to the union of the column spaces of the original and permuted design matrices. RPT can be proved to achieve finite-population size validity under fixed design with just exchangeable noises, whenever . Moreover, RPT is shown to be asymptotically powerful for heavy tailed noises with bounded -th order moment when the true coefficient is at least of order for . We further proved that this signal size requirement is essentially rate-optimal in the minimax sense. Numerical studies confirm that RPT performs well in a wide range of simulation settings with normal and heavy-tailed noise distributions.
Paper Structure (42 sections, 35 theorems, 232 equations, 5 figures, 4 tables, 2 algorithms)

This paper contains 42 sections, 35 theorems, 232 equations, 5 figures, 4 tables, 2 algorithms.

Key Result

Lemma 1

Suppose $\boldsymbol{Y}$ is generated under Eq:Model with $\beta\in\mathbb{R}^p, b = 0$. Suppose also that $\boldsymbol{\varepsilon}$ is a random vector that is almost surely not a zero vector, $(\boldsymbol{X}, \boldsymbol{Z})$ is either deterministic or independent from $\boldsymbol{\varepsilon}$.

Figures (5)

  • Figure 1: Histogram of p-values under the null for ANOVA test and naive residual permutation test from $100000$ Monte-Carol replicates. The first line are the histograms of the ANOVA test under different specifications. Specifically, (a) is the result with Gaussian design, $n = 300, p = 100$ and $\boldsymbol{\varepsilon}$ has independent $t_1$ components; (b) is the histogram with the same setting as in (a) except that we switch from Gaussian design to $t_1$ design; (c) is the histogram with Gaussian design, $n = 600, p = 100$ and $\boldsymbol{\varepsilon}$ has independent $t_1$ components. The second line are the histogram for naive test. (e)-(f) use the same simulation settings as (a)-(c).
  • Figure 2: Power (proportion of rejections) with nominal level $\alpha = 0.01$ (represented by the horizontal dashed line) over 10000 replicates for $b = 0$ or on a logarithmic grid between 0.01 and 2. Here $\boldsymbol{X}, \boldsymbol{\varepsilon}$ and $\boldsymbol{e}$ are generated according to various distribution types prescribed in the caption of each figure.
  • Figure 3: Power (proportion of rejections) with nominal level $\alpha = 0.05$ (represented by the horizontal dashed line) over 10000 replicates for $b = 0$ or on a logarithmic grid between 0.01 and 2. Here $\boldsymbol{X}, \boldsymbol{\varepsilon}$ and $\boldsymbol{e}$ are generated according to various distribution types prescribed in the caption of each figure.
  • Figure A1: Power (proportion of rejections) with nominal level $\alpha = 0.01$ (represented by the horizontal dashed line) over 10000 replicates for $b = 0$ or on a logarithmic grid between 0.01 and 2. Here $\boldsymbol{X}, \boldsymbol{\varepsilon}$ and $\boldsymbol{e}$ are generated according to various distribution types prescribed in the caption of each figure.
  • Figure A2: Power (proportion of rejections) with nominal level $\alpha = 0.01$ (represented by the horizontal dashed line) over 10000 replicates for $b = 0$ or on a logarithmic grid between 0.01 and 2. Here $\boldsymbol{X}$ has independent $\mathcal{N}(0,1)$ entries, $\boldsymbol{e}$ has independent $t_1$ entries, $\boldsymbol{\varepsilon}$ either has independent $t_1$ entries, independent of $\boldsymbol{e}$ (independent noise), or a mixture of $t_1$ and $2t_1$ entries dependent on the sign of $\boldsymbol{e}$ (dependent noise). The covariate $\boldsymbol{Z}$ is modelled either using \ref{['Eq:Model2']} (linear relation) or generated through the nonlinear model described at the end of Section \ref{['sec:addnum']}.

Theorems & Definitions (70)

  • Definition 1
  • Lemma 1
  • Lemma 2
  • Theorem 2
  • Lemma 3
  • Theorem 3
  • Theorem 4
  • Proposition 1
  • Theorem 5
  • Theorem 6
  • ...and 60 more