Table of Contents
Fetching ...

R-Tuning: Wavelet-Decomposed Replay and Semantic Alignment for Continual Adaptation of Pretrained Time-Series Models

Tianyi Yin, Jingwei Wang, Chenze Wang, Han Wang, Jiexuan Cai, Min Liu, Yunlong Ma, Kun Gao, Yuting Song, Weiming Shen

TL;DR

R-Tuning tackles the problem of continual adaptation for pre-trained time-series forecasting models under limited data access by introducing two core components: Wavelet-guided Replay, which generates frequency-aware, diverse replay samples from a frozen old model, and Semantic Alignment via latent distillation, which preserves prior knowledge in a compact latent space. The method demonstrates substantial improvements on new tasks (up to 46% reductions in MAE/MSE) while maintaining or slightly improving performance on old tasks (up to ~6% gains), and remains effective in few-shot regimes with as little as 4–5% synthetic data. These results indicate a practical, data-efficient strategy for deploying and updating pre-trained time-series models in dynamic environments, with potential for online and multi-modal extensions.

Abstract

Pre-trained models have demonstrated exceptional generalization capabilities in time-series forecasting; however, adapting them to evolving data distributions remains a significant challenge. A key hurdle lies in accessing the original training data, as fine-tuning solely on new data often leads to catastrophic forgetting. To address this issue, we propose Replay Tuning (R-Tuning), a novel framework designed for the continual adaptation of pre-trained time-series models. R-Tuning constructs a unified latent space that captures both prior and current task knowledge through a frequency-aware replay strategy. Specifically, it augments model-generated samples via wavelet-based decomposition across multiple frequency bands, generating trend-preserving and fusion-enhanced variants to improve representation diversity and replay efficiency. To further reduce reliance on synthetic samples, R-Tuning introduces a latent consistency constraint that aligns new representations with the prior task space. This constraint guides joint optimization within a compact and semantically coherent latent space, ensuring robust knowledge retention and adaptation. Extensive experimental results demonstrate the superiority of R-Tuning, which reduces MAE and MSE by up to 46.9% and 46.8%, respectively, on new tasks, while preserving prior knowledge with gains of up to 5.7% and 6.0% on old tasks. Notably, under few-shot settings, R-Tuning outperforms all state-of-the-art baselines even when synthetic proxy samples account for only 5% of the new task dataset.

R-Tuning: Wavelet-Decomposed Replay and Semantic Alignment for Continual Adaptation of Pretrained Time-Series Models

TL;DR

R-Tuning tackles the problem of continual adaptation for pre-trained time-series forecasting models under limited data access by introducing two core components: Wavelet-guided Replay, which generates frequency-aware, diverse replay samples from a frozen old model, and Semantic Alignment via latent distillation, which preserves prior knowledge in a compact latent space. The method demonstrates substantial improvements on new tasks (up to 46% reductions in MAE/MSE) while maintaining or slightly improving performance on old tasks (up to ~6% gains), and remains effective in few-shot regimes with as little as 4–5% synthetic data. These results indicate a practical, data-efficient strategy for deploying and updating pre-trained time-series models in dynamic environments, with potential for online and multi-modal extensions.

Abstract

Pre-trained models have demonstrated exceptional generalization capabilities in time-series forecasting; however, adapting them to evolving data distributions remains a significant challenge. A key hurdle lies in accessing the original training data, as fine-tuning solely on new data often leads to catastrophic forgetting. To address this issue, we propose Replay Tuning (R-Tuning), a novel framework designed for the continual adaptation of pre-trained time-series models. R-Tuning constructs a unified latent space that captures both prior and current task knowledge through a frequency-aware replay strategy. Specifically, it augments model-generated samples via wavelet-based decomposition across multiple frequency bands, generating trend-preserving and fusion-enhanced variants to improve representation diversity and replay efficiency. To further reduce reliance on synthetic samples, R-Tuning introduces a latent consistency constraint that aligns new representations with the prior task space. This constraint guides joint optimization within a compact and semantically coherent latent space, ensuring robust knowledge retention and adaptation. Extensive experimental results demonstrate the superiority of R-Tuning, which reduces MAE and MSE by up to 46.9% and 46.8%, respectively, on new tasks, while preserving prior knowledge with gains of up to 5.7% and 6.0% on old tasks. Notably, under few-shot settings, R-Tuning outperforms all state-of-the-art baselines even when synthetic proxy samples account for only 5% of the new task dataset.

Paper Structure

This paper contains 23 sections, 15 equations, 4 figures, 5 tables, 1 algorithm.

Figures (4)

  • Figure 1: Two types of adaptation methods. Rehearsal-based methods mitigate forgetting by training on synthesized samples and new data. Regularization-based methods constrain parameter updates within the shared feature space.
  • Figure 2: The architecture of R-Tuning. Random seeds are sampled from the Gaussian distribution as inputs to the old model. High-quality synthetic samples are generated via the Wavelet-guided Replay mechanism and combined with the new task dataset for fine-tuning. The Semantic Alignment mechanism ensures a performance balance between the new and old tasks.
  • Figure 3: Decompose the signal using Redundant Wavelet Transform. Freq denotes frequency.
  • Figure 4: The performance under different proportions of the synthetic replay sample.