Table of Contents
Fetching ...

Can time series forecasting be automated? A benchmark and analysis

Anvitha Thirthapura Sreedhara, Joaquin Vanschoren

TL;DR

The paper introduces a comprehensive benchmark for time series forecasting across diverse datasets drawn from the Monash Time Series Forecasting Repository to systematically evaluate AutoGluon-Timeseries and sktime. It analyzes both frameworks under standardized training budgets and a tuned sktime pipeline, using SMAPE and MASE as primary metrics. Key findings show AutoGluon–Timeseries generally achieving stronger, more robust performance across frequencies and domains, with PatchTST and ensembling frequently contributing to gains, while certain sktime methods excel in specific settings. The work demonstrates the value of automated forecasting pipelines for method selection and highlights avenues for extending AutoML-based forecasting with richer datasets, meta-models, and transfer learning for improved deployment in practice.

Abstract

In the field of machine learning and artificial intelligence, time series forecasting plays a pivotal role across various domains such as finance, healthcare, and weather. However, the task of selecting the most suitable forecasting method for a given dataset is a complex task due to the diversity of data patterns and characteristics. This research aims to address this challenge by proposing a comprehensive benchmark for evaluating and ranking time series forecasting methods across a wide range of datasets. This study investigates the comparative performance of many methods from two prominent time series forecasting frameworks, AutoGluon-Timeseries, and sktime to shed light on their applicability in different real-world scenarios. This research contributes to the field of time series forecasting by providing a robust benchmarking methodology and facilitating informed decision-making when choosing forecasting methods for achieving optimal prediction.

Can time series forecasting be automated? A benchmark and analysis

TL;DR

The paper introduces a comprehensive benchmark for time series forecasting across diverse datasets drawn from the Monash Time Series Forecasting Repository to systematically evaluate AutoGluon-Timeseries and sktime. It analyzes both frameworks under standardized training budgets and a tuned sktime pipeline, using SMAPE and MASE as primary metrics. Key findings show AutoGluon–Timeseries generally achieving stronger, more robust performance across frequencies and domains, with PatchTST and ensembling frequently contributing to gains, while certain sktime methods excel in specific settings. The work demonstrates the value of automated forecasting pipelines for method selection and highlights avenues for extending AutoML-based forecasting with richer datasets, meta-models, and transfer learning for improved deployment in practice.

Abstract

In the field of machine learning and artificial intelligence, time series forecasting plays a pivotal role across various domains such as finance, healthcare, and weather. However, the task of selecting the most suitable forecasting method for a given dataset is a complex task due to the diversity of data patterns and characteristics. This research aims to address this challenge by proposing a comprehensive benchmark for evaluating and ranking time series forecasting methods across a wide range of datasets. This study investigates the comparative performance of many methods from two prominent time series forecasting frameworks, AutoGluon-Timeseries, and sktime to shed light on their applicability in different real-world scenarios. This research contributes to the field of time series forecasting by providing a robust benchmarking methodology and facilitating informed decision-making when choosing forecasting methods for achieving optimal prediction.
Paper Structure (24 sections, 8 figures, 15 tables)

This paper contains 24 sections, 8 figures, 15 tables.

Figures (8)

  • Figure 1: (a) Trend, Seasonality and (b) Stationarity in a time series data
  • Figure 4: Frequency of different models trained by AutoGluon
  • Figure 5: Wins and losses for all the sktime methods
  • Figure 6: CD diagram and violin plot for SMAPE
  • Figure 7: CD diagram and violin plot for MASE
  • ...and 3 more figures