High-dimensional estimation of quadratic variation based on penalized realized variance
Kim Christensen, Mikkel Slot Nielsen, Mark Podolskij
TL;DR
The paper tackles high-dimensional estimation of quadratic variation for continuous Itô semimartingales by introducing the penalized realized variance (PRV), which imposes a low-rank structure via nuclear-norm regularization and is computed through soft-thresholding the RV eigenvalues. It develops a complete non-asymptotic theory, establishing sharp bounds on estimation error and rank, and proves minimax optimality up to logarithmic factors. A data-driven tuning parameter via subsampling is proposed to facilitate practical implementation, along with a theoretical treatment of local variance estimation. Empirical and simulation results indicate that the PRV effectively identifies a small number of driving factors (typically 1–3) in high dimensions and exhibits rank compression during financial distress, consistent with factor-based asset pricing models.
Abstract
In this paper, we develop a penalized realized variance (PRV) estimator of the quadratic variation (QV) of a high-dimensional continuous Itô semimartingale. We adapt the principle idea of regularization from linear regression to covariance estimation in a continuous-time high-frequency setting. We show that under a nuclear norm penalization, the PRV is computed by soft-thresholding the eigenvalues of realized variance (RV). It therefore encourages sparsity of singular values or, equivalently, low rank of the solution. We prove our estimator is minimax optimal up to a logarithmic factor. We derive a concentration inequality, which reveals that the rank of PRV is -- with a high probability -- the number of non-negligible eigenvalues of the QV. Moreover, we also provide the associated non-asymptotic analysis for the spot variance. We suggest an intuitive data-driven subsampling procedure to select the shrinkage parameter. Our theory is supplemented by a simulation study and an empirical application. The PRV detects about three-five factors in the equity market, with a notable rank decrease during times of distress in financial markets. This is consistent with most standard asset pricing models, where a limited amount of systematic factors driving the cross-section of stock returns are perturbed by idiosyncratic errors, rendering the QV -- and also RV -- of full rank.
