AWEMixer: Adaptive Wavelet-Enhanced Mixer Network for Long-Term Time Series Forecasting
Qianyang Li, Xingjun Zhang, Peng Tao, Shaoxun Wang, Yancheng Pan, Jia Wei
TL;DR
AWEMixer tackles long-term forecasting for non-stationary, multi-scale IoT time series by integrating an adaptive wavelet-based frequency stream with a Mixer-style temporal backbone. The model introduces a Frequency Router to dynamically weight wavelet subbands and a Coherent Gated Fusion block to selectively fuse time–frequency features via cross-attention and gating, all within a dual-stream, cross-scale architecture that maintains linear complexity. Empirical results on seven benchmarks show state-of-the-art performance against Transformer and MLP-based baselines, with thorough ablations confirming the contributions of adaptive frequency weighting and gated fusion. The approach offers precise time–frequency localization, robustness to noise, and practical scalability for ultra-long horizons, with potential extensions to adaptive wavelets and anomaly detection.
Abstract
Forecasting long-term time series in IoT environments remains a significant challenge due to the non-stationary and multi-scale characteristics of sensor signals. Furthermore, error accumulation causes a decrease in forecast quality when predicting further into the future. Traditional methods are restricted to operate in time-domain, while the global frequency information achieved by Fourier transform would be regarded as stationary signals leading to blur the temporal patterns of transient events. We propose AWEMixer, an Adaptive Wavelet-Enhanced Mixer Network including two innovative components: 1) a Frequency Router designs to utilize the global periodicity pattern achieved by Fast Fourier Transform to adaptively weight localized wavelet subband, and 2) a Coherent Gated Fusion Block to achieve selective integration of prominent frequency features with multi-scale temporal representation through cross-attention and gating mechanism, which realizes accurate time-frequency localization while remaining robust to noise. Seven public benchmarks validate that our model is more effective than recent state-of-the-art models. Specifically, our model consistently achieves performance improvement compared with transformer-based and MLP-based state-of-the-art models in long-sequence time series forecasting. Code is available at https://github.com/hit636/AWEMixer
