Contrastive Learning Is Not Optimal for Quasiperiodic Time Series
Adrian Atienza, Jakob Bardram, Sadasivan Puthusserypady
TL;DR
The paper addresses the mismatch between contrastive self-supervised learning objectives and the needs of quasiperiodic time series, where within-record temporal dynamics are informative but often ignored when forcing cross-record distinctiveness. It proposes Distilled Embedding for Almost-Periodic Time Series (DEAPS), a non-contrastive SSL framework that disentangles static and dynamic patterns using dual projectors, a Gradual Loss $L_{gra}$ to encourage dynamic evolution, and selective optimization with a covariance regularizer. Empirically, DEAPS yields up to about 10% improvements on downstream tasks with few labeled records across AFib identification, gender classification, and Physionet 2017, and PCA analyses illustrate clearer disentanglement of static and dynamic components compared to contrastive methods. The approach offers a practical, label-efficient pathway for physiological time series analysis and suggests broader applicability to other quasiperiodic signals, though its pretraining is currently demonstrated on a single database (SHHS).
Abstract
Despite recent advancements in Self-Supervised Learning (SSL) for time series analysis, a noticeable gap persists between the anticipated achievements and actual performance. While these methods have demonstrated formidable generalization capabilities with minimal labels in various domains, their effectiveness in distinguishing between different classes based on a limited number of annotated records is notably lacking. Our hypothesis attributes this bottleneck to the prevalent use of Contrastive Learning, a shared training objective in previous state-of-the-art (SOTA) methods. By mandating distinctiveness between representations for negative pairs drawn from separate records, this approach compels the model to encode unique record-based patterns but simultaneously neglects changes occurring across the entire record. To overcome this challenge, we introduce Distilled Embedding for Almost-Periodic Time Series (DEAPS) in this paper, offering a non-contrastive method tailored for quasiperiodic time series, such as electrocardiogram (ECG) data. By avoiding the use of negative pairs, we not only mitigate the model's blindness to temporal changes but also enable the integration of a "Gradual Loss (Lgra)" function. This function guides the model to effectively capture dynamic patterns evolving throughout the record. The outcomes are promising, as DEAPS demonstrates a notable improvement of +10% over existing SOTA methods when just a few annotated records are presented to fit a Machine Learning (ML) model based on the learned representation.
