Table of Contents
Fetching ...

TimeCapsule: Solving the Jigsaw Puzzle of Long-Term Time Series Forecasting with Compressed Predictive Representations

Yihang Lu, Yangyang Xu, Qitao Qing, Xianwei Meng

TL;DR

TimeCapsule tackles long-term forecasting for multivariate time series by treating data as a 3D tensor (temporal, variate, level) and applying high-dimensional information compression to unify diverse modeling principles. The approach combines mode-specific multi-head self-attention (MoMSA) with low-rank compression, residual information back, and JEPA-based internal prediction to learn compact, predictive representations that can map back to the temporal domain via a lightweight decoder. Extensive experiments across ten datasets show competitive or state-of-the-art performance, with notable improvements on very long horizons and insights into when multi-level modeling is most beneficial. The work demonstrates the practicality of a simple, compression-centric design that balances expressivity and efficiency, while highlighting limitations around fixed compression sizes and component usage for future generalization.

Abstract

Recent deep learning models for Long-term Time Series Forecasting (LTSF) often emphasize complex, handcrafted designs, while simpler architectures like linear models or MLPs have often outperformed these intricate solutions. In this paper, we revisit and organize the core ideas behind several key techniques, such as redundancy reduction and multi-scale modeling, which are frequently employed in advanced LTSF models. Our goal is to streamline these ideas for more efficient deep learning utilization. To this end, we introduce TimeCapsule, a model built around the principle of high-dimensional information compression that unifies these techniques in a generalized yet simplified framework. Specifically, we model time series as a 3D tensor, incorporating temporal, variate, and level dimensions, and leverage mode production to capture multi-mode dependencies while achieving dimensionality compression. We propose an internal forecast within the compressed representation domain, supported by the Joint-Embedding Predictive Architecture (JEPA), to monitor the learning of predictive representations. Extensive experiments on challenging benchmarks demonstrate the versatility of our method, showing that TimeCapsule can achieve state-of-the-art performance.

TimeCapsule: Solving the Jigsaw Puzzle of Long-Term Time Series Forecasting with Compressed Predictive Representations

TL;DR

TimeCapsule tackles long-term forecasting for multivariate time series by treating data as a 3D tensor (temporal, variate, level) and applying high-dimensional information compression to unify diverse modeling principles. The approach combines mode-specific multi-head self-attention (MoMSA) with low-rank compression, residual information back, and JEPA-based internal prediction to learn compact, predictive representations that can map back to the temporal domain via a lightweight decoder. Extensive experiments across ten datasets show competitive or state-of-the-art performance, with notable improvements on very long horizons and insights into when multi-level modeling is most beneficial. The work demonstrates the practicality of a simple, compression-centric design that balances expressivity and efficiency, while highlighting limitations around fixed compression sizes and component usage for future generalization.

Abstract

Recent deep learning models for Long-term Time Series Forecasting (LTSF) often emphasize complex, handcrafted designs, while simpler architectures like linear models or MLPs have often outperformed these intricate solutions. In this paper, we revisit and organize the core ideas behind several key techniques, such as redundancy reduction and multi-scale modeling, which are frequently employed in advanced LTSF models. Our goal is to streamline these ideas for more efficient deep learning utilization. To this end, we introduce TimeCapsule, a model built around the principle of high-dimensional information compression that unifies these techniques in a generalized yet simplified framework. Specifically, we model time series as a 3D tensor, incorporating temporal, variate, and level dimensions, and leverage mode production to capture multi-mode dependencies while achieving dimensionality compression. We propose an internal forecast within the compressed representation domain, supported by the Joint-Embedding Predictive Architecture (JEPA), to monitor the learning of predictive representations. Extensive experiments on challenging benchmarks demonstrate the versatility of our method, showing that TimeCapsule can achieve state-of-the-art performance.

Paper Structure

This paper contains 37 sections, 13 equations, 24 figures, 10 tables.

Figures (24)

  • Figure 1: We visualize the ETTh1 dataset as an example to illustrate the need to compress the information in different MvTS datasets' dimensions.
  • Figure 2: Categorization of advanced LTSF models into four groups based on their core techniques.
  • Figure 3: Overview of the TimeCapsule model. The original time series is transformed into a 3D representation by adding a level dimension, traversing through TransBlocks and tunnels (vanilla transformer blocks), and is then projected into the predictive space using JEPA. The compressed capsule is gradually recovered back into the real temporal domain.
  • Figure 4: We visualize ETTh1 dataset as a typical example to illustrate 512-96 predictions of different components (trend, stationary, deviation) of time series. All results are obtained under the recommended settings of the source code.
  • Figure 5: The top figures show the variation trend of JEPA loss without backpropagation, while the bottom figures show the effect of including JEPA loss as a training objective on performance.
  • ...and 19 more figures

Theorems & Definitions (1)

  • Definition 1: Mode Product