Zero-shot forecasting of chaotic systems
Yuanzhao Zhang, William Gilpin
TL;DR
Problem: Can zero-shot forecasting with large, domain-agnostic time-series models generalize to chaotic dynamical systems without task-specific training? Approach: Benchmark Chronos on $135$ chaotic systems, using short-term forecast metrics like $VPT$ and $sMAPE$, and long-term attractor metrics such as fractal dimension $d_{frac}$ and KL divergence $D_{stsp}$, across varying context lengths and with shuffled-context tests to probe in-context learning. Findings: larger Chronos models achieve short-term forecasts competitive with fully-trained baselines, excel at long-term attractor reconstruction, and exhibit in-context learning via context parroting, even when context order is scrambled; performance degrades under nonstationarity. Significance: demonstrates a scalable, training-free forecasting paradigm for complex nonlinear dynamics with potential implications for SciML and nonlinear dynamics, and points to future work on robustness and task-tuned adaptations.
Abstract
Time-series forecasting is a challenging problem that traditionally requires specialized models custom-trained for the specific task at hand. Recently, inspired by the success of large language models, foundation models pre-trained on vast amounts of time-series data from diverse domains have emerged as a promising candidate for general-purpose time-series forecasting. The defining characteristic of these foundation models is their ability to perform zero-shot learning, that is, forecasting a new system from limited context data without explicit re-training or fine-tuning. Here, we evaluate whether the zero-shot learning paradigm extends to the challenging task of forecasting chaotic systems. Across 135 distinct chaotic dynamical systems and $10^8$ timepoints, we find that foundation models produce competitive forecasts compared to custom-trained models (including NBEATS, TiDE, etc.), particularly when training data is limited. Interestingly, even after point forecasts fail, large foundation models are able to preserve the geometric and statistical properties of the chaotic attractors. We attribute this success to foundation models' ability to perform in-context learning and identify context parroting as a simple mechanism used by these models to capture the long-term behavior of chaotic dynamical systems. Our results highlight the potential of foundation models as a tool for probing nonlinear and complex systems.
