Sundial: A Family of Highly Capable Time Series Foundation Models

Yong Liu; Guo Qin; Zhiyuan Shi; Zhi Chen; Caiyin Yang; Xiangdong Huang; Jianmin Wang; Mingsheng Long

Sundial: A Family of Highly Capable Time Series Foundation Models

Yong Liu, Guo Qin, Zhiyuan Shi, Zhi Chen, Caiyin Yang, Xiangdong Huang, Jianmin Wang, Mingsheng Long

TL;DR

Sundial addresses the non-determinism of time series by learning a flexible generative model that conditions on history to sample from $p(x_{t+1:t+f}|oldsymbol{h}_t)$. It uses TimeFlow Loss within a flow-matching framework to train a decoder-only Transformer on continuous-valued sequences without discrete tokenization, enabling multiple plausible futures with fast test-time generation. The approach is bolstered by patch-based tokenization, RoPE-enhanced attention, and a trillion-point TimeBench pre-training corpus, delivering state-of-the-art zero-shot performance on both point and probabilistic benchmarks. Together, these contributions unlock scalable, reliable, and efficient generative forecasting for real-world decision-making across domains such as weather, energy, and finance.

Abstract

We introduce Sundial, a family of native, flexible, and scalable time series foundation models. To predict the next-patch's distribution, we propose a TimeFlow Loss based on flow-matching, which facilitates native pre-training of Transformers on continuous-valued time series without discrete tokenization. Conditioned on arbitrary-length time series, our models are pre-trained without specifying any prior distribution and can generate multiple probable predictions, achieving more flexibility in representation learning than using parametric densities. Towards time series foundation models, we leverage minimal but crucial adaptations of Transformers and curate TimeBench with one trillion time points, comprising mostly real-world datasets and synthetic data. By mitigating mode collapse via TimeFlow Loss, we pre-train a family of Sundial models on TimeBench, which achieve unprecedented model capacity and generalization performance. In addition to excellent scalability, Sundial achieves state-of-the-art results on both point and probabilistic forecasting benchmarks with a just-in-time inference speed, i.e., making zero-shot predictions within a few milliseconds. We believe that Sundial's pioneering generative forecasting capability can improve model reliability in real-world decision-making. Code is available at: https://github.com/thuml/Sundial.

Sundial: A Family of Highly Capable Time Series Foundation Models

TL;DR

Abstract

Sundial: A Family of Highly Capable Time Series Foundation Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)