The Capacity and Robustness Trade-off: Revisiting the Channel Independent Strategy for Multivariate Time Series Forecasting
Lu Han, Han-Jia Ye, De-Chuan Zhan
TL;DR
The paper investigates why Channel Independent (CI) training often beats Channel Dependent (CD) in multivariate time series forecasting. Through extensive experiments across nine real-world datasets and multiple algorithms, CI typically yields lower MAE/MSE and less variance, challenging the assumption that modeling channel correlations via CD is always beneficial. The authors provide theoretical and empirical analysis on a linear model showing CI trades capacity for robustness to distribution drift, manifested in channel-wise ACF differences, and propose Predict Residuals with Regularization (PRReg) to boost CD performance. They offer practical guidelines and discuss factors that influence CI/CD performance, aiming to inform the design of more robust MTS forecasting approaches.
Abstract
Multivariate time series data comprises various channels of variables. The multivariate forecasting models need to capture the relationship between the channels to accurately predict future values. However, recently, there has been an emergence of methods that employ the Channel Independent (CI) strategy. These methods view multivariate time series data as separate univariate time series and disregard the correlation between channels. Surprisingly, our empirical results have shown that models trained with the CI strategy outperform those trained with the Channel Dependent (CD) strategy, usually by a significant margin. Nevertheless, the reasons behind this phenomenon have not yet been thoroughly explored in the literature. This paper provides comprehensive empirical and theoretical analyses of the characteristics of multivariate time series datasets and the CI/CD strategy. Our results conclude that the CD approach has higher capacity but often lacks robustness to accurately predict distributionally drifted time series. In contrast, the CI approach trades capacity for robust prediction. Practical measures inspired by these analyses are proposed to address the capacity and robustness dilemma, including a modified CD method called Predict Residuals with Regularization (PRReg) that can surpass the CI strategy. We hope our findings can raise awareness among researchers about the characteristics of multivariate time series and inspire the construction of better forecasting models.
