Random Projections and Natural Sparsity in Time-Series Classification: A Theoretical Analysis
Jorge Marco-Blanco, Rubén Cuevas
TL;DR
The paper addresses why the Rocket time-series classifier—built from thousands of random convolutional kernels followed by the PPV nonlinearity and a ridge classifier—works so well without training. It grounds Rocket in compressed sensing, showing that the random Toeplitz transforms preserve discriminative information for sparse signals via RIP and coherence analyses, and that PPV acts as a principled sparsity measure enhanced by a bias term. It demonstrates two key properties—noise robustness and translation invariance—and links the observed strong empirical performance to a low-dimensional, diverse feature space created by many random bases. The work offers actionable insights for parameter configuration and interpretability, with empirical validation across 117 UCR datasets, and outlines directions for dynamic hyperparameter tuning and broader applications.
Abstract
Time-series classification is essential across diverse domains, including medical diagnosis, industrial monitoring, financial forecasting, and human activity recognition. The Rocket algorithm has emerged as a simple yet powerful method, achieving state-of-the-art performance through random convolutional kernels applied to time-series data, followed by non-linear transformation. Its architecture approximates a one-hidden-layer convolutional neural network while eliminating parameter training, ensuring computational efficiency. Despite its empirical success, fundamental questions about its theoretical foundations remain unexplored. We bridge theory and practice by formalizing Rocket's random convolutional filters within the compressed sensing framework, proving that random projections preserve discriminative patterns in time-series data. This analysis reveals relationships between kernel parameters and input signal characteristics, enabling more principled approaches to algorithm configuration. Moreover, we demonstrate that its non-linearity, based on the proportion of positive values after convolutions, expresses the inherent sparsity of time-series data. Our theoretical investigation also proves that Rocket satisfies two critical conditions: translation invariance and noise robustness. These findings enhance interpretability and provide guidance for parameter optimization in extreme cases, advancing both theoretical understanding and practical application of time-series classification.
