Estimation of High-dimensional Nonlinear Vector Autoregressive Models
Yuefeng Han, Likai Chen, Wei Biao Wu
TL;DR
The paper addresses high-dimensional time series with nonlinear dynamics by proposing a sparse additive nonlinear VAR framework built from basis expansions, enabling flexible modeling beyond linear VARs. It develops sharp Bernstein-type tail inequalities for nonlinear VAR processes with non-Gaussian innovations, providing the probabilistic foundation for subsequent high-dimensional analysis. The authors establish convergence rates and model-selection consistency for an $\ell_1$-regularized estimator based on a truncated basis expansion, with explicit bias-variance trade-offs tied to sparsity, smoothness ($\beta$), and truncation level ($L$). A scalable block-coordinate-descent algorithm is proposed, and the framework is validated through simulations and a real gene-regulatory-network time series, demonstrating improved performance over linear VAR in capturing nonlinear interactions. Overall, the work extends VAR theory to nonlinear, non-Gaussian, high-dimensional settings and offers practical tools for inference in complex time-series systems such as genomics and neuroscience.
Abstract
High-dimensional vector autoregressive (VAR) models have numerous applications in fields such as econometrics, biology, climatology, among others. While prior research has mainly focused on linear VAR models, these approaches can be restrictive in practice. To address this, we introduce a high-dimensional non-parametric sparse additive model, providing a more flexible framework. Our method employs basis expansions to construct high-dimensional nonlinear VAR models. We derive convergence rates and model selection consistency for least squared estimators, considering dependence measures of the processes, error moment conditions, sparsity, and basis expansions. Our theory significantly extends prior linear VAR models by incorporating both non-Gaussianity and non-linearity. As a key contribution, we derive sharp Bernstein-type inequalities for tail probabilities in both non-sub-Gaussian linear and nonlinear VAR processes, which match the classical Bernstein inequality for independent random variables. Additionally, we present numerical experiments that support our theoretical findings and demonstrate the advantages of the nonlinear VAR model for a gene expression time series dataset.
