On Statistical Inference for High-Dimensional Binary Time Series
Dehao Dai, Yunyi Zhang
TL;DR
This work tackles inference for high-dimensional binary time series by adopting a generalized binary VAR (gbVAR) framework with sparse coefficient matrices. It introduces a post-selection estimator that yields tractable, close-form estimates on selected supports and proves model-selection consistency, along with a Gaussian approximation for the estimator. To enable valid uncertainty quantification, the authors develop a second-order wild bootstrap that accounts for complex temporal dependence and random coefficients. Through Bernstein-type concentration results and extensive simulations plus real-data applications (portfolio management and global trade), the method demonstrates accurate inference and interpretable, sparsity-driven dynamic networks. Overall, the paper provides a coherent theoretical and empirical toolkit for statistical inference in high-dimensional binary time-series settings with practical relevance to networks and economics.
Abstract
The analysis of non-real-valued data, such as binary time series, has attracted great interest in recent years. This manuscript proposes a post-selection estimator for estimating the coefficient matrices of a high-dimensional generalized binary vector autoregressive process and establishes a Gaussian approximation theorem for the proposed estimator. Furthermore, it introduces a second-order wild bootstrap algorithm to enable statistical inference on the coefficient matrices. Numerical studies and empirical applications demonstrate the good finite-sample performance of the proposed method.
