Less is More: Unlocking Specialization of Time Series Foundation Models via Structured Pruning
Lifan Zhao, Yanyan Shen, Zhaoyang Liu, Xue Wang, Jiaji Deng
TL;DR
The study exposes inherent sparsity and task-specific substructures in Time Series Foundation Models and proposes a prune-then-finetune pipeline to preserve architectural priors while specializing TSFMs for downstream forecasting. By defining pruning units as input/output channels and employing loss-guided importance scores, the method progressively removes redundant components before fine-tuning, yielding improved accuracy and up to 7x faster inference. Across seven TSFMs and six benchmarks, the approach often outperforms full fine-tuning and strong specialized baselines, with notable zero-shot transfer benefits within related domains. This work highlights architectural specialization as a practical route to unlock TSFMs' potential in real-world forecasting tasks.
Abstract
Scaling laws motivate the development of Time Series Foundation Models (TSFMs) that pre-train vast parameters and achieve remarkable zero-shot forecasting performance. Surprisingly, even after fine-tuning, TSFMs cannot consistently outperform smaller, specialized models trained on full-shot downstream data. A key question is how to realize effective adaptation of TSFMs for a target forecasting task. Through empirical studies on various TSFMs, the pre-trained models often exhibit inherent sparsity and redundancy in computation, suggesting that TSFMs have learned to activate task-relevant network substructures to accommodate diverse forecasting tasks. To preserve this valuable prior knowledge, we propose a structured pruning method to regularize the subsequent fine-tuning process by focusing it on a more relevant and compact parameter space. Extensive experiments on seven TSFMs and six benchmarks demonstrate that fine-tuning a smaller, pruned TSFM significantly improves forecasting performance compared to fine-tuning original models. This prune-then-finetune paradigm often enables TSFMs to achieve state-of-the-art performance and surpass strong specialized baselines. Source code is made publicly available at https://github.com/SJTU-DMTai/Prune-then-Finetune.
