Position: Universal Time Series Foundation Models Rest on a Category Error
Xilin Dai, Wanxu Cai, Zhijian Xu, Qiang Xu
TL;DR
The paper argues that universal time-series foundation models embody a category error by treating diverse time-series domains as a single modality, which leads to negative transfer and reliance on expensive Generic Filters. It formalizes this with the Autoregressive Blindness Bound, showing history-only predictors cannot anticipate unobserved interventions $U_t$, and introduces the Causal Control Agent architecture (Perceiver-Controller-Solver) that leverages external context via a JIT Solver to handle regime shifts. It advocates shifting benchmarks from zero-shot accuracy to drift-adaptation metrics like Time-to-Recovery (TTR) and Intervention Recall, and it promotes architectural separation, using Perceivers for perception and Agents for reasoning, guided by a control-theoretic mindset. The proposed paradigm aims to produce robust forecasting systems that continually adapt to interventions, rather than rely on static, universal priors, with practical impact on critical infrastructures and high-stakes deployments.
Abstract
This position paper argues that the pursuit of "Universal Foundation Models for Time Series" rests on a fundamental category error, mistaking a structural Container for a semantic Modality. We contend that because time series hold incompatible generative processes (e.g., finance vs. fluid dynamics), monolithic models degenerate into expensive "Generic Filters" that fail to generalize under distributional drift. To address this, we introduce the "Autoregressive Blindness Bound," a theoretical limit proving that history-only models cannot predict intervention-driven regime shifts. We advocate replacing universality with a Causal Control Agent paradigm, where an agent leverages external context to orchestrate a hierarchy of specialized solvers, from frozen domain experts to lightweight Just-in-Time adaptors. We conclude by calling for a shift in benchmarks from "Zero-Shot Accuracy" to "Drift Adaptation Speed" to prioritize robust, control-theoretic systems.
