Denoising-Aware Contrastive Learning for Noisy Time Series
Shuang Zhou, Daochen Zha, Xiao Shen, Xiao Huang, Rui Zhang, Fu-Lai Chung
TL;DR
The paper tackles the challenge of noisy time-series data limiting self-supervised learning (SSL) performance. It introduces DEnoising-aware Contrastive Learning (DECL), an end-to-end framework that entwines denoising with contrastive SSL via a denoiser-driven objective and an automatic per-sample denoiser selection mechanism, guided by an auto-regressive Transformer-based encoder. DECL jointly optimizes an auto-regressive loss and a denoiser-weighted contrastive loss, where per-sample denoisers are selected using reconstruction errors and regularization to avoid overfitting. Across five noisy datasets, DECL yields consistent improvements in linear and fine-tuned evaluations, demonstrates robustness to varying noise levels, and offers transferable representations, with code released publicly.
Abstract
Time series self-supervised learning (SSL) aims to exploit unlabeled data for pre-training to mitigate the reliance on labels. Despite the great success in recent years, there is limited discussion on the potential noise in the time series, which can severely impair the performance of existing SSL methods. To mitigate the noise, the de facto strategy is to apply conventional denoising methods before model training. However, this pre-processing approach may not fully eliminate the effect of noise in SSL for two reasons: (i) the diverse types of noise in time series make it difficult to automatically determine suitable denoising methods; (ii) noise can be amplified after mapping raw data into latent space. In this paper, we propose denoising-aware contrastive learning (DECL), which uses contrastive learning objectives to mitigate the noise in the representation and automatically selects suitable denoising methods for every sample. Extensive experiments on various datasets verify the effectiveness of our method. The code is open-sourced.
