STanHop: Sparse Tandem Hopfield Model for Memory-Enhanced Time Series Prediction
Dennis Wu, Jerry Yao-Chieh Hu, Weijian Li, Bo-Yu Chen, Han Liu
TL;DR
This work introduces STanHop-Net, a memory-augmented framework for multivariate time series forecasting built from Generalized Sparse Hopfield (GSH) layers that bridge memory retrieval with attention. It formalizes a generalized sparse Hopfield model, proving tighter retrieval-error bounds and exponential memory capacity, and provides practical GSH layers (GSH, GSHPooling, GSHLayer) for deep learning. STanHop-Net stacks tandem TimeGSH and SeriesGSH blocks, employs patching and coarse-graining for multi-resolution learning, and integrates external memory via Plug-and-Play and Tune-and-Play plugins to rapidly respond to sudden events. Empirical results on six real-world datasets show strong performance with and without external memory, including notable gains in memory-enabled scenarios, and the approach offers a flexible path to memory-augmented time-series foundation models with theoretical guarantees. The combination of sparse associative memory, multi-resolution structure, and task-tailored external memory yields faster convergence, robustness to noise, and practical benefits for real-time inference in dynamic environments.
Abstract
We present STanHop-Net (Sparse Tandem Hopfield Network) for multivariate time series prediction with memory-enhanced capabilities. At the heart of our approach is STanHop, a novel Hopfield-based neural network block, which sparsely learns and stores both temporal and cross-series representations in a data-dependent fashion. In essence, STanHop sequentially learn temporal representation and cross-series representation using two tandem sparse Hopfield layers. In addition, StanHop incorporates two additional external memory modules: a Plug-and-Play module and a Tune-and-Play module for train-less and task-aware memory-enhancements, respectively. They allow StanHop-Net to swiftly respond to certain sudden events. Methodologically, we construct the StanHop-Net by stacking STanHop blocks in a hierarchical fashion, enabling multi-resolution feature extraction with resolution-specific sparsity. Theoretically, we introduce a sparse extension of the modern Hopfield model (Generalized Sparse Modern Hopfield Model) and show that it endows a tighter memory retrieval error compared to the dense counterpart without sacrificing memory capacity. Empirically, we validate the efficacy of our framework on both synthetic and real-world settings.
