EMAformer: Enhancing Transformer through Embedding Armor for Time Series Forecasting
Zhiwei Zhang, Xinyi Du, Xuanchi Guo, Weihao Wang, Wenjuan Han
TL;DR
The paper addresses unstable inter-channel dynamics in multivariate time series forecasting with Transformers. It proposes EMAformer, which injects three inductive biases via channel, phase, and joint channel-phase embeddings into variate-tokenized inputs, preserving the backbone architecture. Empirically, EMAformer achieves state-of-the-art results on 12 real-world benchmarks, with average improvements of 2.73% in MSE and 5.15% in MAE, notably excelling on high-channel datasets and challenging PEMS tasks. This approach offers a practical, architecture-agnostic path to enhance Transformer-based time series forecasting, with publicly available code for reproducibility.
Abstract
Multivariate time series forecasting is crucial across a wide range of domains. While presenting notable progress for the Transformer architecture, iTransformer still lags behind the latest MLP-based models. We attribute this performance gap to unstable inter-channel relationships. To bridge this gap, we propose EMAformer, a simple yet effective model that enhances the Transformer with an auxiliary embedding suite, akin to armor that reinforces its ability. By introducing three key inductive biases, i.e., \textit{global stability}, \textit{phase sensitivity}, and \textit{cross-axis specificity}, EMAformer unlocks the further potential of the Transformer architecture, achieving state-of-the-art performance on 12 real-world benchmarks and reducing forecasting errors by an average of 2.73\% in MSE and 5.15\% in MAE. This significantly advances the practical applicability of Transformer-based approaches for multivariate time series forecasting. The code is available on https://github.com/PlanckChang/EMAformer.
