TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting
Shiyu Wang, Haixu Wu, Xiaoming Shi, Tengge Hu, Huakun Luo, Lintao Ma, James Y. Zhang, Jun Zhou
TL;DR
TimeMixer introduces a fully MLP-based forecasting model that exploits a novel multiscale mixing paradigm. By disentangling past information into seasonal and trend components with Past-Decomposable-Mixing and by ensembling multiscale future predictions with Future-Multipredictor-Mixing, it achieves state-of-the-art results across both long-term and short-term tasks while remaining efficient. The approach is extensively validated on diverse real-world benchmarks, supported by thorough ablations and visualizations that illuminate the distinct roles of seasonal and trend channels and the benefits of scale-wise prediction. The work offers a practical, scalable solution for complex, non-stationary time-series forecasting and suggests future directions for integrating alternative mixing schemes and cross-dimension interactions.
Abstract
Time series forecasting is widely used in extensive applications, such as traffic planning and weather forecasting. However, real-world time series usually present intricate temporal variations, making forecasting extremely challenging. Going beyond the mainstream paradigms of plain decomposition and multiperiodicity analysis, we analyze temporal variations in a novel view of multiscale-mixing, which is based on an intuitive but important observation that time series present distinct patterns in different sampling scales. The microscopic and the macroscopic information are reflected in fine and coarse scales respectively, and thereby complex variations can be inherently disentangled. Based on this observation, we propose TimeMixer as a fully MLP-based architecture with Past-Decomposable-Mixing (PDM) and Future-Multipredictor-Mixing (FMM) blocks to take full advantage of disentangled multiscale series in both past extraction and future prediction phases. Concretely, PDM applies the decomposition to multiscale series and further mixes the decomposed seasonal and trend components in fine-to-coarse and coarse-to-fine directions separately, which successively aggregates the microscopic seasonal and macroscopic trend information. FMM further ensembles multiple predictors to utilize complementary forecasting capabilities in multiscale observations. Consequently, TimeMixer is able to achieve consistent state-of-the-art performances in both long-term and short-term forecasting tasks with favorable run-time efficiency.
