Hyperdimensional Vector Tsetlin Machines with Applications to Sequence Learning and Generation
Christian D. Blakely
TL;DR
This work introduces a hybrid HVTM architecture that combines Hyperdimensional Vector Computing with Tsetlin Machines to learn, classify, forecast, and generate sequences. By encoding sequences into high-dimensional vectors using interval embeddings and N-Grams, and by leveraging associative memory for rapid retrieval, the approach achieves competitive time-series classification on the UCR Archive while retaining a lightweight memory footprint. The authors also demonstrate forecasting and generation capabilities with a two-step HV-based pipeline and provide extensive numerical experiments on harmonic and stochastic sequences, highlighting the benefits of 5-Gram encoding. Overall, the HVTM framework offers a fast, interpretable, and memory-efficient alternative to deep learning sequence models, well-suited for online/embedded settings.
Abstract
We construct a two-layered model for learning and generating sequential data that is both computationally fast and competitive with vanilla Tsetlin machines, adding numerous advantages. Through the use of hyperdimensional vector computing (HVC) algebras and Tsetlin machine clause structures, we demonstrate that the combination of both inherits the generality of data encoding and decoding of HVC with the fast interpretable nature of Tsetlin machines to yield a powerful machine learning model. We apply the approach in two areas, namely in forecasting, generating new sequences, and classification. For the latter, we derive results for the entire UCR Time Series Archive and compare with the standard benchmarks to see how well the method competes in time series classification.
