This Time is Different: An Observability Perspective on Time Series Foundation Models

Ben Cohen; Emaad Khwaja; Youssef Doubli; Salahidine Lemaachi; Chris Lettieri; Charles Masson; Hugo Miccinilli; Elise Ramé; Qiqi Ren; Afshin Rostamizadeh; Jean Ogier du Terrail; Anna-Monica Toon; Kan Wang; Stephan Xie; Zongzhe Xu; Viktoriya Zhukova; David Asker; Ameet Talwalkar; Othmane Abou-Amal

This Time is Different: An Observability Perspective on Time Series Foundation Models

Ben Cohen, Emaad Khwaja, Youssef Doubli, Salahidine Lemaachi, Chris Lettieri, Charles Masson, Hugo Miccinilli, Elise Ramé, Qiqi Ren, Afshin Rostamizadeh, Jean Ogier du Terrail, Anna-Monica Toon, Kan Wang, Stephan Xie, Zongzhe Xu, Viktoriya Zhukova, David Asker, Ameet Talwalkar, Othmane Abou-Amal

TL;DR

This work develops Toto, a decoder-only time-series foundation model engineered for zero-shot forecasting on observability data, featuring per-patch causal normalization, time-variate attention, and a robust Student-T mixture output. It is trained on a colossal mixed corpus including Datadog telemetry, public datasets, and synthetic data, and is evaluated against a new large-scale observability benchmark, Boom, as well as established multi-domain benchmarks. Toto achieves state-of-the-art results across Boom, GIFT-Eval, and LSF, with notable improvements in probabilistic calibration and robustness to heavy-tailed distributions. By open-sourcing both Toto and Boom under Apache 2.0, the authors aim to accelerate practical adoption and further research in scalable, domain-specific time-series forecasting for observability workloads.

Abstract

We introduce Toto, a time series forecasting foundation model with 151 million parameters. Toto uses a modern decoder-only architecture coupled with architectural innovations designed to account for specific challenges found in multivariate observability time series data. Toto's pre-training corpus is a mixture of observability data, open datasets, and synthetic data, and is 4-10$\times$ larger than those of leading time series foundation models. Additionally, we introduce BOOM, a large-scale benchmark consisting of 350 million observations across 2,807 real-world time series. For both Toto and BOOM, we source observability data exclusively from Datadog's own telemetry and internal observability metrics. Extensive evaluations demonstrate that Toto achieves state-of-the-art performance on both BOOM and on established general purpose time series forecasting benchmarks. Toto's model weights, inference code, and evaluation scripts, as well as BOOM's data and evaluation code, are all available as open source under the Apache 2.0 License available at https://huggingface.co/Datadog/Toto-Open-Base-1.0 and https://github.com/DataDog/toto.

This Time is Different: An Observability Perspective on Time Series Foundation Models

TL;DR

Abstract

This Time is Different: An Observability Perspective on Time Series Foundation Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)