An Open-Source and Reproducible Implementation of LSTM and GRU Networks for Time Series Forecasting

Gissel Velarde; Pedro Branez; Alejandro Bueno; Rodrigo Heredia; Mateo Lopez-Ledezma

An Open-Source and Reproducible Implementation of LSTM and GRU Networks for Time Series Forecasting

Gissel Velarde, Pedro Branez, Alejandro Bueno, Rodrigo Heredia, Mateo Lopez-Ledezma

TL;DR

The paper addresses the need for reproducible open-source baselines in time-series forecasting with LSTM and GRU networks. It presents an end-to-end implementation and evaluation on two datasets—a real financial BANKEX series and a synthetic Activities series—using RMSE and DA to quantify predictive accuracy and directional correctness. Key findings show that LSTM and GRU substantially improve forecasts over a naive baseline on the Activities dataset, while improvements on BANKEX are not evident, highlighting data-dependent performance and the importance of hyperparameter tuning. The work contributes an openly available, reproducible framework and datasets to enable future benchmarking and comparisons across forecasting methods, with implications for practitioners seeking transparent evaluation in time-series forecasting.

Abstract

This paper introduces an open-source and reproducible implementation of Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) Networks for time series forecasting. We evaluated LSTM and GRU networks because of their performance reported in related work. We describe our method and its results on two datasets. The first dataset is the S&P BSE BANKEX, composed of stock time series (closing prices) of ten financial institutions. The second dataset, called Activities, comprises ten synthetic time series resembling weekly activities with five days of high activity and two days of low activity. We report Root Mean Squared Error (RMSE) between actual and predicted values, as well as Directional Accuracy (DA). We show that a single time series from a dataset can be used to adequately train the networks if the sequences in the dataset contain patterns that repeat, even with certain variation, and are properly processed. For 1-step ahead and 20-step ahead forecasts, LSTM and GRU networks significantly outperform a baseline on the Activities dataset. The baseline simply repeats the last available value. On the stock market dataset, the networks perform just like the baseline, possibly due to the nature of these series. We release the datasets used as well as the implementation with all experiments performed to enable future comparisons and to make our research reproducible.

An Open-Source and Reproducible Implementation of LSTM and GRU Networks for Time Series Forecasting

TL;DR

Abstract

An Open-Source and Reproducible Implementation of LSTM and GRU Networks for Time Series Forecasting

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)