Multi-Scale Finetuning for Encoder-based Time Series Foundation Models

Zhongzheng Qiao; Chenghao Liu; Yiming Zhang; Ming Jin; Quang Pham; Qingsong Wen; P. N. Suganthan; Xudong Jiang; Savitha Ramasamy

Multi-Scale Finetuning for Encoder-based Time Series Foundation Models

Zhongzheng Qiao, Chenghao Liu, Yiming Zhang, Ming Jin, Quang Pham, Qingsong Wen, P. N. Suganthan, Xudong Jiang, Savitha Ramasamy

TL;DR

This work tackles the challenge of finetuning encoder-based Time Series Foundation Models (TSFMs) for downstream tasks. It introduces Multiscale Finetuning (MSFT), a causally informed framework that explicitly models multiple sampling scales via scale-specific adapters, decoupled within-scale and cross-scale dependencies, and learned multi-scale prediction mixing. By aligning finetuning with the interventional distribution $P(Y|do(X))$ through backdoor adjustment, MSFT mitigates confounding from scale and leverages pretrained multi-scale knowledge to boost forecasting accuracy. Empirically, MSFT consistently outperforms naive finetuning and PEFT baselines across long sequence and probabilistic forecasting tasks on multiple backbones (Moirai, Moment, UniTS), often surpassing state-of-the-art models trained from scratch while maintaining reasonable efficiency. This approach advances practical deployment of TSFMs by enabling scalable, robust fine-tuning that respects the inherent multi-scale nature of time series data.

Abstract

Time series foundation models (TSFMs) demonstrate impressive zero-shot performance for time series forecasting. However, an important yet underexplored challenge is how to effectively finetune TSFMs on specific downstream tasks. While naive finetuning can yield performance gains, we argue that it falls short of fully leveraging TSFMs' capabilities, often resulting in overfitting and suboptimal performance. Given the diverse temporal patterns across sampling scales and the inherent multi-scale forecasting capabilities of TSFMs, we adopt a causal perspective to analyze finetuning process, through which we highlight the critical importance of explicitly modeling multiple scales and reveal the shortcomings of naive approaches. Focusing on encoder-based TSFMs, we propose Multiscale finetuning (MSFT), a simple yet general framework that explicitly integrates multi-scale modeling into the finetuning process. Experimental results on three different backbones (Moirai, Moment and Units) demonstrate that TSFMs finetuned with MSFT not only outperform naive and typical parameter efficient finetuning methods but also surpass state-of-the-art deep learning methods. Codes are available at https://github.com/zqiao11/MSFT.

Multi-Scale Finetuning for Encoder-based Time Series Foundation Models

TL;DR

Abstract

Multi-Scale Finetuning for Encoder-based Time Series Foundation Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)