Finite-Sample Identification of Linear Regression Models with Residual-Permuted Sums
Szabolcs Szentpéteri, Balázs Csanád Csáji
TL;DR
This work introduces Residual-Permuted Sums (RPS), a distribution-free, finite-sample method for constructing confidence regions in linear regression by permuting residuals rather than perturbing signs. Under mild, general assumptions including i.i.d. noises $W_t$ with finite fourth moments, RPS achieves exact finite-sample coverage with a provable uniform strong consistency, and it extends the existing Sign-Perturbed Sums (SPS) ellipsoidal outer-approximation (EOA) to the permutation-based setting. A permutation-lemma-based proof establishes a strong law for randomly permuted sequences, which underpins the uniform convergence of RPS regions to the true parameter $\theta^*$. Simulation results demonstrate that RPS regions can be tighter than SPS and comparable to asymptotic regions while preserving finite-sample guarantees, highlighting its practical value for robust system identification and related applications.
Abstract
This letter studies a distribution-free, finite-sample data perturbation (DP) method, the Residual-Permuted Sums (RPS), which is an alternative of the Sign-Perturbed Sums (SPS) algorithm, to construct confidence regions. While SPS assumes independent (but potentially time-varying) noise terms which are symmetric about zero, RPS gets rid of the symmetricity assumption, but assumes i.i.d. noises. The main idea is that RPS permutes the residuals instead of perturbing their signs. This letter introduces RPS in a flexible way, which allows various design-choices. RPS has exact finite sample coverage probabilities and we provide the first proof that these permutation-based confidence regions are uniformly strongly consistent under general assumptions. This means that the RPS regions almost surely shrink around the true parameters as the sample size increases. The ellipsoidal outer-approximation (EOA) of SPS is also extended to RPS, and the effectiveness of RPS is validated by numerical experiments, as well.
