A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Anish Saha; Konstantin Shmakov

A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Anish Saha, Konstantin Shmakov

Abstract

In-context learning (ICL) allows a model to adapt at inference time by conditioning on examples rather than updating parameters. Existing time-series foundation models use implicit positional context, retrieval, or task-specific objectives, but rarely explicit instruction-conditioned demonstrations. We present a foundation model for instruction-conditioned in-context time-series tasks based on a quantile-regression T5 encoder-decoder. Historical examples and queries are encoded with a structured tokenization scheme that marks target series, covariates, context, and task-specific future information. A hierarchical Transformer with per-example encoding, example-level fusion, and cross-example attention conditions decoding on demonstration pairs, enabling forecasting and related tasks without task-specific fine-tuning. We train on large-scale real and synthetic time series using supervised forecasting plus self-supervised tasks, including imputation, reconstruction, classification, anomaly detection, and source demixing. This multi-task training learns a distribution over task mappings and improves adaptation to local structure at inference time. Across diverse datasets, frequencies, and horizons, our method outperforms strong foundation baselines on point and probabilistic forecasting benchmarks, including fev-bench and GIFT-Eval, while remaining competitive on classification and anomaly detection.

A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Abstract

Paper Structure (77 sections, 43 equations, 5 figures, 14 tables, 1 algorithm)

This paper contains 77 sections, 43 equations, 5 figures, 14 tables, 1 algorithm.

Introduction
Related Work
Methodology
Problem Formulation and ICL Setup
Data Representation and Granularity
Multivariate time series and covariates
Examples and Queries
In-context learning input
Structured Tokenization and Input Construction
Target and Covariate types
Structured In-context learning input
Amortized Meta-Learning Task Definition
Model Architecture
Base Time-Series Encoder $\phi$
Hierarchical Attention Fusion
...and 62 more sections

Figures (5)

Figure 1: Results of the GIFT-Eval benchmark: Aggregated scores of the overall benchmark. Lower values are better. "Zero-shot Models" are not trained on this data. The In-domain Models are partly trained on the dataset (Overlap:: Moirai 2.0 19%, TimesFM-2.5 10%, TTM: 16%).
Figure 2: Results of the fev-bench benchmark: Aggregated scores of the overall benchmark. Lower values are better. "Zero-shot Models" are not trained on this data.
Figure 3: Overall and long-term performance on the GIFT-Eval benchmark. (Train-evaluation overlap: Moirai 2.0 19%, TimesFM-2.5 10%, TTM 16%)
Figure 4: Term length performance on the GIFT-Eval benchmark. (Train-evaluation overlap: Moirai 2.0 19%, TimesFM-2.5 10%, TTM 16%)
Figure 5: Result on univariate and multivariate inputs on the GIFT-Eval benchmark. (Train-evaluation overlap: Moirai 2.0 19%, TimesFM-2.5 10%, TTM 16%)

A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Abstract

A Foundation Model for Instruction-Conditioned In-Context Time Series Tasks

Authors

Abstract

Table of Contents

Figures (5)