Parametric Augmentation for Time Series Contrastive Learning
Xu Zheng, Tianchun Wang, Wei Cheng, Aitian Ma, Haifeng Chen, Mo Sha, Dongsheng Luo
TL;DR
The paper addresses the challenge of designing effective augmentations for time series contrastive learning by introducing AutoTCL, a parametric augmentation framework that factorizes each instance into an informative component $x^*$ and a task-irrelevant part $\Delta x$, then learns a lossless, adaptive view via an invertible transform $g$ and a learnable mask $h$. A principled objective based on the Principle of Relevant Information (PRI) guides the augmentation network, balancing information preservation with view diversity, while a time-series encoder trains with both global and local contrastive losses. Empirical results on forecasting and classification demonstrate consistent improvements over strong baselines, with univariate forecasting improved by around 6.5% in MSE and 4.8% in MAE, and classification gains of about 1.2% in average accuracy. The approach is encoder-agnostic and shows robust benefits across multiple backbones and datasets, highlighting the practical impact of adaptive, factorization-based augmentations for time series representations.
Abstract
Modern techniques like contrastive learning have been effectively used in many areas, including computer vision, natural language processing, and graph-structured data. Creating positive examples that assist the model in learning robust and discriminative representations is a crucial stage in contrastive learning approaches. Usually, preset human intuition directs the selection of relevant data augmentations. Due to patterns that are easily recognized by humans, this rule of thumb works well in the vision and language domains. However, it is impractical to visually inspect the temporal structures in time series. The diversity of time series augmentations at both the dataset and instance levels makes it difficult to choose meaningful augmentations on the fly. In this study, we address this gap by analyzing time series data augmentation using information theory and summarizing the most commonly adopted augmentations in a unified format. We then propose a contrastive learning framework with parametric augmentation, AutoTCL, which can be adaptively employed to support time series representation learning. The proposed approach is encoder-agnostic, allowing it to be seamlessly integrated with different backbone encoders. Experiments on univariate forecasting tasks demonstrate the highly competitive results of our method, with an average 6.5\% reduction in MSE and 4.7\% in MAE over the leading baselines. In classification tasks, AutoTCL achieves a $1.2\%$ increase in average accuracy.
