High-Dimensional Binary Variates: Maximum Likelihood Estimation with Nonstationary Covariates and Factors
Xinbing Kong, Bin Wu, Wuyi Ye
TL;DR
This paper develops a high-dimensional binary factor model with nonstationary covariates and factors, accommodating both nonstationary and cointegrated single indices. It establishes novel MLE-based asymptotics, including dual convergence rates for coefficient estimators under $I(1)$ dynamics and faster rates under cointegration, with factor estimators exhibiting time-dependent behavior in the nonstationary case and time-invariant behavior under cointegration. A rank-minimization approach selects the number of factors, with consistency guarantees; the limiting distributions differ markedly between the nonstationary and cointegrated regimes. The methodology is validated via Monte Carlo simulations and an empirical application to jump arrivals in financial markets, where extracted jump-arrival factors enhance asset-pricing performance beyond standard FF factors.
Abstract
This paper introduces a high-dimensional binary variate model that accommodates nonstationary covariates and factors, and studies their asymptotic theory. This framework encompasses scenarios where single indices are nonstationary or cointegrated. For nonstationary single indices, the maximum likelihood estimator (MLE) of the coefficients has dual convergence rates and is collectively consistent under the condition $T^{1/2}/N\to0$, as both the cross-sectional dimension $N$ and the time horizon $T$ approach infinity. The MLE of all nonstationary factors is consistent when $T^δ/N\to0$, where $δ$ depends on the link function. The limiting distributions of the factors depend on time $t$, governed by the convergence of the Hessian matrix to zero. In the case of cointegrated single indices, the MLEs of both factors and coefficients converge at a higher rate of $\min(\sqrt{N},\sqrt{T})$. A distinct feature compared to nonstationary single indices is that the dual rate of convergence of the coefficients increases from $(T^{1/4},T^{3/4})$ to $(T^{1/2},T)$. Moreover, the limiting distributions of the factors do not depend on $t$ in the cointegrated case. Monte Carlo simulations verify the accuracy of the estimates. In an empirical application, we analyze jump arrivals in financial markets using this model, extract jump arrival factors, and demonstrate their efficacy in large-cross-section asset pricing.
