Fredformer: Frequency Debiased Transformer for Time Series Forecasting
Xihao Piao, Zheng Chen, Taichi Murayama, Yasuko Matsubara, Yasushi Sakurai
TL;DR
Fredformer tackles frequency bias in Transformer-based time series forecasting, where low-frequency components tend to dominate learning due to energy concentration. It introduces a frequency-debiased Transformer built on a DFT-to-IDFT backbone, frequency refinement/normalization, and local independent modeling to equalize learning across frequency bands, plus a Nyström-based lightweight variant for efficiency. Empirical results on eight real-world datasets show state-of-the-art forecasting accuracy and clear reduction in frequency bias compared with baselines, with ablations confirming the importance of each component. The method enables more reliable short- and mid-term forecasts in diverse domains and offers a practical path toward scalable frequency-aware Transformers.
Abstract
The Transformer model has shown leading performance in time series forecasting. Nevertheless, in some complex scenarios, it tends to learn low-frequency features in the data and overlook high-frequency features, showing a frequency bias. This bias prevents the model from accurately capturing important high-frequency data features. In this paper, we undertook empirical analyses to understand this bias and discovered that frequency bias results from the model disproportionately focusing on frequency features with higher energy. Based on our analysis, we formulate this bias and propose Fredformer, a Transformer-based framework designed to mitigate frequency bias by learning features equally across different frequency bands. This approach prevents the model from overlooking lower amplitude features important for accurate forecasting. Extensive experiments show the effectiveness of our proposed approach, which can outperform other baselines in different real-world time-series datasets. Furthermore, we introduce a lightweight variant of the Fredformer with an attention matrix approximation, which achieves comparable performance but with much fewer parameters and lower computation costs. The code is available at: https://github.com/chenzRG/Fredformer
