Sparse High-Dimensional Vector Autoregressive Bootstrap
Robert Adamek, Stephan Smeekes, Ines Wilms
TL;DR
This work develops a sparse high-dimensional VAR multiplier bootstrap to approximate the distribution of the max-mean statistic $Q=\max_{1\le j\le N}|\frac{1}{\sqrt{T}}\sum_{t=1}^T x_{j,t}|$ when $N$ may exceed $T$. It combines lasso-based VAR estimation with a multiplier bootstrap on residuals to generate bootstrap samples that preserve dependence, and proves bootstrap consistency under sub-Gaussian and finite-moment error assumptions via a high-dimensional CLT for linear processes. A key theoretical contribution is a Gaussian approximation for the maximum mean of linear processes, enabling asymptotically exact inference for high-dimensional means and related statistics. The simulations show generally good size and power, with performance varying across DGPs, particularly under high persistence; the framework is positioned as a foundation for more general high-dimensional time-series inference and potential bias-correction extensions.
Abstract
We introduce a high-dimensional multiplier bootstrap for time series data based on capturing dependence through a sparsely estimated vector autoregressive model. We prove its consistency for inference on high-dimensional means under two different moment assumptions on the errors, namely sub-gaussian moments and a finite number of absolute moments. In establishing these results, we derive a Gaussian approximation for the maximum mean of a linear process, which may be of independent interest.
