Efficient Sampling for Realized Variance Estimation in Time-Changed Diffusion Models
Timo Dimitriadis, Roxana Halbleib, Jeannine Polivka, Jasper Rennspies, Sina Streicher, Axel Friedrich Wolter
TL;DR
The paper tackles efficient realized variance estimation by sampling intraday returns in intrinsic time within a tick-time stochastic volatility (TTSV) framework that separates trading intensity and tick variance. It derives finite-sample MSE decompositions showing that homogenizing intraday returns (HTS) minimizes MSE in noise-free environments, while a novel realized BTS (rBTS) scheme, which blends observed ticks with estimated tick variance, dominates when price observations are contaminated by market microstructure noise. The TTSV model provides a transparent IV decomposition, $IV(0,T)=\int_0^T \lambda(r)\varsigma^2(r)\,dr$, enabling precise comparisons across sampling schemes and linking efficiency to the information available. Extensive simulations and an empirical NYSE application corroborate the theoretical results, highlighting practical gains in estimation accuracy and forecast performance, and offering a robust alternative when MMN is present.
Abstract
This paper analyzes the benefits of sampling intraday returns in intrinsic time for the realized variance (RV) estimator. We theoretically show in finite samples that depending on the permitted sampling information, the RV estimator is most efficient under either hitting time sampling that samples whenever the price changes by a pre-determined threshold, or under the new concept of realized business time that samples according to a combination of observed trades and estimated tick variance. The analysis builds on the assumption that asset prices follow a diffusion that is time-changed with a jump process that separately models the transaction times. This provides a flexible model that allows for leverage specifications and Hawkes-type jump processes and separately captures the empirically varying trading intensity and tick variance processes, which are particularly relevant for disentangling the driving forces of the sampling schemes. Extensive simulations confirm our theoretical results and show that for low levels of noise, hitting time sampling remains superior while for increasing noise levels, realized business time becomes the empirically most efficient sampling scheme. An application to stock data provides empirical evidence for the benefits of using these intrinsic sampling schemes to construct more efficient RV estimators as well as for an improved forecast performance.
