TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning

Andreas Auer; Patrick Podest; Daniel Klotz; Sebastian Böck; Günter Klambauer; Sepp Hochreiter

TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning

Andreas Auer, Patrick Podest, Daniel Klotz, Sebastian Böck, Günter Klambauer, Sepp Hochreiter

TL;DR

<TiRex> addresses the challenge of zero-shot forecasting across long and short horizons by introducing a decoder-only xLSTM backbone with Contiguous Patch Masking (CPM) and targeted data augmentations. It enables coherent multi-patch horizon predictions and probabilistic forecasts via nine quantiles and quantile loss, trained on a large, diverse corpus including Chronos, synthetic Gaussian-process data, and GiftEval pre-training data. The model achieves state-of-the-art zero-shot performance on GiftEval-ZS and Chronos-ZS benchmarks with a compact 35M-parameter footprint and fast inference, outperforming significantly larger transformer-based models. The work highlights the effectiveness of state-tracking LSTM variants for time-series in-context learning and provides ablations confirming the benefits of CPM, augmentation, and backbone design for long-horizon uncertainty propagation.

Abstract

In-context learning, the ability of large language models to perform tasks using only examples provided in the prompt, has recently been adapted for time series forecasting. This paradigm enables zero-shot prediction, where past values serve as context for forecasting future values, making powerful forecasting tools accessible to non-experts and increasing the performance when training data are scarce. Most existing zero-shot forecasting approaches rely on transformer architectures, which, despite their success in language, often fall short of expectations in time series forecasting, where recurrent models like LSTMs frequently have the edge. Conversely, while LSTMs are well-suited for time series modeling due to their state-tracking capabilities, they lack strong in-context learning abilities. We introduce TiRex that closes this gap by leveraging xLSTM, an enhanced LSTM with competitive in-context learning skills. Unlike transformers, state-space models, or parallelizable RNNs such as RWKV, TiRex retains state-tracking, a critical property for long-horizon forecasting. To further facilitate its state-tracking ability, we propose a training-time masking strategy called CPM. TiRex sets a new state of the art in zero-shot time series forecasting on the HuggingFace benchmarks GiftEval and Chronos-ZS, outperforming significantly larger models including TabPFN-TS (Prior Labs), Chronos Bolt (Amazon), TimesFM (Google), and Moirai (Salesforce) across both short- and long-term forecasts.

TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning

TL;DR

Abstract

TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (21)