TimeSieve: Extracting Temporal Dynamics through Information Bottlenecks
Ninghui Feng, Songning Lai, Jiayu Yang, Fobao Zhou, Zhenxiao Yin, Hang Zhao
TL;DR
TimeSieve tackles the challenges of manual hyperparameter tuning and distinguishing signal from redundant features in time-series forecasting by fusing wavelet-based multi-scale decomposition with an information bottleneck module. It preprocesses data with a Wavelet Decomposition Block, filters redundancy via an Information Filtering and Compression Block guided by the Information Bottleneck objective, and reconstructs with a Wavelet Reconstruction Block before forecasting with an MLP; the IB objective balances maximizing mutual information with the target and minimizing information about the input, formalized as $ \max I(y; z) - \min I(x; z)$, with the IFCB loss incorporating $D_{KL}$ terms. The approach achieves state-of-the-art performance on approximately 70% of benchmarks, especially on datasets with strong seasonal redundancy, and demonstrates robust generalization across diverse domains. Overall, TimeSieve offers a practical, parameter-efficient framework for reliable, scalable time-series forecasting that leverages multi-scale structure and information-theoretic filtering.
Abstract
Time series forecasting has become an increasingly popular research area due to its critical applications in various real-world domains such as traffic management, weather prediction, and financial analysis. Despite significant advancements, existing models face notable challenges, including the necessity of manual hyperparameter tuning for different datasets, and difficulty in effectively distinguishing signal from redundant features in data characterized by strong seasonality. These issues hinder the generalization and practical application of time series forecasting models. To solve this issues, we propose an innovative time series forecasting model TimeSieve designed to address these challenges. Our approach employs wavelet transforms to preprocess time series data, effectively capturing multi-scale features without the need for additional parameters or manual hyperparameter tuning. Additionally, we introduce the information bottleneck theory that filters out redundant features from both detail and approximation coefficients, retaining only the most predictive information. This combination reduces significantly improves the model's accuracy. Extensive experiments demonstrate that our model outperforms existing state-of-the-art methods on 70% of the datasets, achieving higher predictive accuracy and better generalization across diverse datasets. Our results validate the effectiveness of our approach in addressing the key challenges in time series forecasting, paving the way for more reliable and efficient predictive models in practical applications. The code for our model is available at https://github.com/xll0328/TimeSieve.
