Hyperparameter Tuning MLPs for Probabilistic Time Series Forecasting
Kiran Madhusudhanan, Shayan Jawed, Lars Schmidt-Thieme
TL;DR
This work addresses the gap in hyperparameter optimization for time-series forecasting with neural models by focusing on time-series-specific hyperparameters such as context length and validation strategy. It analyzes the NLinear MLP variant and extends it to probabilistic forecasting within GluonTS, including distribution heads and gradient statistics. The authors introduce TSBench, the largest metadataset to date for probabilistic time-series forecasting, totaling $97{,}200$ evaluations across $20$ Monash datasets with $4{,}800$ configurations per dataset and multi-fidelity evaluation capabilities. They benchmark multiple HPO strategies (SMAC, HyperBand, BOHB, Random), demonstrate context-length and learning-rate importance, and show that a linear MLP is a robust baseline while TSBench enables effective transfer and learning-curve forecasting insights for HPO.
Abstract
Time series forecasting attempts to predict future events by analyzing past trends and patterns. Although well researched, certain critical aspects pertaining to the use of deep learning in time series forecasting remain ambiguous. Our research primarily focuses on examining the impact of specific hyperparameters related to time series, such as context length and validation strategy, on the performance of the state-of-the-art MLP model in time series forecasting. We have conducted a comprehensive series of experiments involving 4800 configurations per dataset across 20 time series forecasting datasets, and our findings demonstrate the importance of tuning these parameters. Furthermore, in this work, we introduce the largest metadataset for timeseries forecasting to date, named TSBench, comprising 97200 evaluations, which is a twentyfold increase compared to previous works in the field. Finally, we demonstrate the utility of the created metadataset on multi-fidelity hyperparameter optimization tasks.
