Analyzing the retraining frequency of global forecasting models: towards more stable forecasting systems
Marco Zanotti
TL;DR
This study investigates how retraining frequency affects the stability of global forecasting models across large-scale datasets (M4, M5, VN1). It introduces Scaled Multi-Quantile Change (SMQC), a model-agnostic metric for probabilistic stability, and demonstrates that less frequent retraining often preserves or enhances both forecast stability and accuracy, especially for global models. The work contrasts global models with local baselines, showing that cross-learning yields robustness to update frequency and can reduce computational costs substantially. Practical guidelines for stability-aware retraining schedules are provided, with daily data benefiting from roughly 21–30 day cycles and weekly data up to 8 weeks, along with a shift toward prioritizing stability alongside accuracy. The findings have implications for deploying sustainable, stable forecasting systems in retail, energy, and supply-chain contexts.
Abstract
Forecast stability, that is, the consistency of predictions over time, is essential in business settings where sudden shifts in forecasts can disrupt planning and erode trust in predictive systems. Despite its importance, stability is often overlooked in favor of accuracy. In this study, we evaluate the stability of point and probabilistic forecasts across several retraining scenarios using three large forecastingdatasets and ten different global forecasting models. To analyze stability in the probabilistic setting, we propose a new model-agnostic, distribution-free, and scale-free metric that measuresprobabilistic stability: the Scaled Multi-Quantile Change (SMQC). The results show that less frequent retraining not only preserves but often improves forecast stability, challenging the need for frequent retraining. Moreover, the study shows that accuracy and stability are not necessarily conflicting objectives when adopting a global modeling approach. The study promotes a shift toward stability-aware forecasting practices, proposing a new metric to evaluate forecast stability effectively in probabilistic settings, and offering practical guidelines for building more stable and sustainable forecasting systems.
