Generalisation Bounds of Zero-Shot Economic Forecasting using Time Series Foundation Models
Jittarin Jetwiriyanon, Teo Susnjak, Surangika Ranathunga
TL;DR
This paper assesses zero-shot forecasting with Time Series Foundation Models (TSFMs) for macroeconomic indicators, using New Zealand data as a challenging testbed. By evaluating TimeGPT-1, Chronos, and Moirai in a purely univariate, pre-trained setting against classical baselines (Persistence, ARIMA, LSBoost, and Factor models), the study demonstrates that Moirai-based architectures can match or exceed traditional benchmarks during stable periods and exhibit notable resilience during shocks, though performance can degrade without domain covariates. Key findings show Moirai often delivering the lowest RMSE and good calibration across horizons, with Chronos also performing strongly, while TimeGPT exhibits mixed results. The results offer practical guidance on when zero-shot deployments are viable for macro monitoring and highlight the need for multivariate extensions, modest fine-tuning, and calibrated predictive distributions to handle regime changes and crises more robustly.
Abstract
This study investigates zero-shot forecasting capabilities of Time Series Foundation Models (TSFMs) for macroeconomic indicators. We apply TSFMs to forecasting economic indicators under univariate conditions, bypassing the need for train bespoke econometric models using and extensive training datasets. Our experiments were conducted on a case study dataset, without additional customisation. We rigorously back-tested three state-of-the-art TSFMs (Chronos, TimeGPT and Moirai) under data-scarce conditions and structural breaks. Our results demonstrate that appropriately engineered TSFMs can internalise rich economic dynamics, accommodate regime shifts, and deliver well-behaved uncertainty estimates out of the box, while matching state-of-the-art multivariate models on this domain. Our findings suggest that, without any fine-tuning, TSFMs can match or exceed classical models during stable economic conditions. However, they are vulnerable to degradation in performances during periods of rapid shocks. The findings offer guidance to practitioners on when zero-shot deployments are viable for macroeconomic monitoring and strategic planning.
