Monash Time Series Forecasting Archive
Rakshitha Godahewa, Christoph Bergmeir, Geoffrey I. Webb, Rob J. Hyndman, Pablo Montero-Manso
TL;DR
The paper addresses the lack of benchmarks for global time series forecasting by introducing the Monash Time Series Forecasting Archive, a comprehensive collection of 20 datasets (with 50 variations) spanning varied domains and frequencies, plus a flexible tsf data format and accompanying tooling. It characterizes the datasets through feature analysis using tsfeatures and catch22, and provides baseline forecast evaluations across eight error metrics for seven forecasting methods including a globally trained model. Key contributions include the dataset archive, a standardized data format, feature-space insights, and publicly available baseline results and code to enable rigorous benchmarking of future global forecasting approaches. The work is significant for practitioners and researchers aiming to develop and evaluate cross-series forecasting methods with realistic, heterogeneous data and reproducible evaluation pipelines.
Abstract
Many businesses and industries nowadays rely on large quantities of time series data making time series forecasting an important research area. Global forecasting models that are trained across sets of time series have shown a huge potential in providing accurate forecasts compared with the traditional univariate forecasting models that work on isolated series. However, there are currently no comprehensive time series archives for forecasting that contain datasets of time series from similar sources available for the research community to evaluate the performance of new global forecasting algorithms over a wide variety of datasets. In this paper, we present such a comprehensive time series forecasting archive containing 20 publicly available time series datasets from varied domains, with different characteristics in terms of frequency, series lengths, and inclusion of missing values. We also characterise the datasets, and identify similarities and differences among them, by conducting a feature analysis. Furthermore, we present the performance of a set of standard baseline forecasting methods over all datasets across eight error metrics, for the benefit of researchers using the archive to benchmark their forecasting algorithms.
