ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning
Jiecheng Lu, Xu Han, Shihao Yang
TL;DR
The paper tackles the challenge of multivariate long-term time series forecasting by identifying that existing multivariate Transformers struggle to model series-wise differences. It proposes ARM, a modular framework with Adaptive Univariate Effect Learning (AUEL), Random Dropping (RD), and Multi-kernel Local Smoothing (MKLS), to disentangle univariate patterns, decouple inter-series dependencies during training, and construct flexible local temporal representations. The approach yields state-of-the-art results on nine datasets with only a modest increase in computation and is transferable to other LTSF architectures beyond vanilla Transformers. Collectively, AUEL, RD, and MKLS enable robust, scalable handling of diverse multivariate time series, offering practical improvements for real-world forecasting tasks.
Abstract
Long-term time series forecasting (LTSF) is important for various domains but is confronted by challenges in handling the complex temporal-contextual relationships. As multivariate input models underperforming some recent univariate counterparts, we posit that the issue lies in the inefficiency of existing multivariate LTSF Transformers to model series-wise relationships: the characteristic differences between series are often captured incorrectly. To address this, we introduce ARM: a multivariate temporal-contextual adaptive learning method, which is an enhanced architecture specifically designed for multivariate LTSF modelling. ARM employs Adaptive Univariate Effect Learning (AUEL), Random Dropping (RD) training strategy, and Multi-kernel Local Smoothing (MKLS), to better handle individual series temporal patterns and correctly learn inter-series dependencies. ARM demonstrates superior performance on multiple benchmarks without significantly increasing computational costs compared to vanilla Transformer, thereby advancing the state-of-the-art in LTSF. ARM is also generally applicable to other LTSF architecture beyond vanilla Transformer.
