Ister: Inverted Seasonal-Trend Decomposition Transformer for Explainable Multivariate Time Series Forecasting
Fanpu Cao, Shu Yang, Zhengjian Chen, Ye Liu, Laizhong Cui
TL;DR
Ister tackles long-horizon multivariate forecasting by pairing a hierarchical seasonal-trend decomposition with a Dual Transformer backbone and a novel Dot-attention mechanism. The model first decomposes time series into seasonal and trend parts, further splitting seasonal signals into multiple periodic components, and then jointly models inter-series dependencies and multi-periodicity to improve predictive accuracy. Dot-attention provides linear-time channel weighting and tangible interpretability of component contributions, while enabling efficient inference. Experiments on six real-world datasets show state-of-the-art results, with up to 10% improvements in MSE, and visualizable insights into which components drive predictions, making Ister practical for real-world deployment and analysis.
Abstract
In long-term time series forecasting, Transformer-based models have achieved great success, due to its ability to capture long-range dependencies. However, existing models face challenges in identifying critical components for prediction, leading to limited interpretability and suboptimal performance. To address these issues, we propose the Inverted Seasonal-Trend Decomposition Transformer (Ister), a novel Transformer-based model for multivariate time series forecasting. Ister decomposes time series into seasonal and trend components, further modeling multi-periodicity and inter-series dependencies using a Dual Transformer architecture. We introduce a novel Dot-attention mechanism that improves interpretability, computational efficiency, and predictive accuracy. Comprehensive experiments on benchmark datasets demonstrate that Ister outperforms existing state-of-the-art models, achieving up to 10% improvement in MSE. Moreover, Ister enables intuitive visualization of component contributions, shedding lights on model's decision process and enhancing transparency in prediction results.
