Block Toeplitz Sparse Precision Matrix Estimation for Large-Scale Interval-Valued Time Series Forecasting
Wan Tian, Zhongfeng Qin
TL;DR
This work tackles large-scale interval-valued time series forecasting by introducing a feature extraction framework that jointly automates segmentation, clustering, and representation learning. Core ideas include estimating a collection of block Toeplitz sparse precision matrices to capture cluster-specific conditional dependencies, and transforming subsequences into image data via Joint Recurrence Plot for transfer-learning-based feature extraction using WS-DAN. The approach is optimized through a majorization-minimization scheme that alternates between assigning subsequences to clusters (via Viterbi) and estimating the precision matrices (via ADMM with SCAD), with proven convergence. Empirical results on stock data demonstrate that the learned representations substantially improve forecasting accuracy over traditional methods and many deep-learning baselines. The methodology offers a scalable pathway to leverage invariant structure in ITS for enhanced predictive performance in finance and related domains.
Abstract
Modeling and forecasting interval-valued time series (ITS) have attracted considerable attention due to their growing presence in various contexts. To the best of our knowledge, there have been no efforts to model large-scale ITS. In this paper, we propose a feature extraction procedure for large-scale ITS, which involves key steps such as auto-segmentation and clustering, and feature transfer learning. This procedure can be seamlessly integrated with any suitable prediction models for forecasting purposes. Specifically, we transform the automatic segmentation and clustering of ITS into the estimation of Toeplitz sparse precision matrices and assignment set. The majorization-minimization algorithm is employed to convert this highly non-convex optimization problem into two subproblems. We derive efficient dynamic programming and alternating direction method to solve these two subproblems alternately and establish their convergence properties. By employing the Joint Recurrence Plot (JRP) to image subsequence and assigning a class label to each cluster, an image dataset is constructed. Then, an appropriate neural network is chosen to train on this image dataset and used to extract features for the next step of forecasting. Real data applications demonstrate that the proposed method can effectively obtain invariant representations of the raw data and enhance forecasting performance.
