Efficient Time Series Forecasting via Hyper-Complex Models and Frequency Aggregation
Eyal Yakir, Dor Tsur, Haim Permuter
TL;DR
Time-series forecasting with long-range dependencies is enhanced by FIA-Net, which uses STFT windowing to decompose sequences and two backbones: WM-MLP and HC-MLP, to aggregate information across windows. The HC-MLP extends frequency-domain learning with hyper-complex algebra (octonions, p=4), reducing parameters; Top-$M$ frequency compression further improves efficiency, achieving $O(L \log(L/p))$ forward-pass complexity. Empirically, FIA-Net outperforms SoTA on Weather, Exchange, Traffic, Electricity, ETTh1, and ETTm1 benchmarks, with average MAE improvements of 5.4% and RMSE improvements of 3.8%; HC-MLP offers competitive results with far fewer parameters, especially for short horizons. The work provides a practical, scalable approach for nonstationary time-series forecasting and points to future work leveraging Kramers-Kronig relations and broader HC bases.
Abstract
Time series forecasting is a long-standing problem in statistics and machine learning. One of the key challenges is processing sequences with long-range dependencies. To that end, a recent line of work applied the short-time Fourier transform (STFT), which partitions the sequence into multiple subsequences and applies a Fourier transform to each separately. We propose the Frequency Information Aggregation (FIA)-Net, which is based on a novel complex-valued MLP architecture that aggregates adjacent window information in the frequency domain. To further increase the receptive field of the FIA-Net, we treat the set of windows as hyper-complex (HC) valued vectors and employ HC algebra to efficiently combine information from all STFT windows altogether. Using the HC-MLP backbone allows for improved handling of sequences with long-term dependence. Furthermore, due to the nature of HC operations, the HC-MLP uses up to three times fewer parameters than the equivalent standard window aggregation method. We evaluate the FIA-Net on various time-series benchmarks and show that the proposed methodologies outperform existing state of the art methods in terms of both accuracy and efficiency. Our code is publicly available on https://anonymous.4open.science/r/research-1803/.
