MOMENT: A Family of Open Time-series Foundation Models
Mononito Goswami, Konrad Szafer, Arjun Choudhry, Yifu Cai, Shuo Li, Artur Dubrawski
TL;DR
MOMENT presents the first open family of time-series foundation models trained from a large, diverse public data pile called The Time Series Pile, addressing the lack of cohesive public ts data and evaluation benchmarks. It uses a patch-based transformer with masked time-series modeling to pre-train representations that are effective across forecasting, classification, anomaly detection, and imputation, even under zero-shot and limited supervision. The authors provide a rigorous benchmark extending prior work and show that MOMENT can outperform several baselines on multiple tasks, while also offering insights into model scaling and cross-modal transfer. They emphasize open science by releasing data, models, and code, and suggest future work in multi-modal time-series and causal forecasting objectives. Overall, MOMENT demonstrates the feasibility and value of large-scale, open, time-series foundation models for practical analysis under resource constraints.
Abstract
We introduce MOMENT, a family of open-source foundation models for general-purpose time series analysis. Pre-training large models on time series data is challenging due to (1) the absence of a large and cohesive public time series repository, and (2) diverse time series characteristics which make multi-dataset training onerous. Additionally, (3) experimental benchmarks to evaluate these models, especially in scenarios with limited resources, time, and supervision, are still in their nascent stages. To address these challenges, we compile a large and diverse collection of public time series, called the Time series Pile, and systematically tackle time series-specific challenges to unlock large-scale multi-dataset pre-training. Finally, we build on recent work to design a benchmark to evaluate time series foundation models on diverse tasks and datasets in limited supervision settings. Experiments on this benchmark demonstrate the effectiveness of our pre-trained models with minimal data and task-specific fine-tuning. Finally, we present several interesting empirical observations about large pre-trained time series models. Pre-trained models (AutonLab/MOMENT-1-large) and Time Series Pile (AutonLab/Timeseries-PILE) are available on Huggingface.
