Computer Vision Self-supervised Learning Methods on Time Series
Daesoo Lee, Erlend Aune
TL;DR
This study examines whether mainstream computer-vision SSL frameworks can transfer to time-series data, using the UCR and UEA benchmarks. It benchmarks contrastive and non-contrastive methods (e.g., SimCLR, BYOL, SimSiam, Barlow Twins, VICReg) and introduces VIbCReg, an improved VICReg variant with a normalized covariance matrix and IterNorm to accelerate learning. VIbCReg achieves state-of-the-art linear evaluation and competitive SVM performance on UCR/UEA, while using a lightweight 1D-ResNet encoder, demonstrating cross-domain applicability of CV SSL ideas to time series. The work highlights faster representation learning and stronger feature decorrelation, suggesting future work on longer sequences and more flexible encoders. Overall, VIbCReg offers a robust, transfer-friendly SSL approach for time-series representation learning with practical implications for industrial, financial, and IoT data analysis.
Abstract
Self-supervised learning (SSL) has had great success in both computer vision. Most of the current mainstream computer vision SSL frameworks are based on Siamese network architecture. These approaches often rely on cleverly crafted loss functions and training setups to avoid feature collapse. In this study, we evaluate if those computer-vision SSL frameworks are also effective on a different modality (\textit{i.e.,} time series). The effectiveness is experimented and evaluated on the UCR and UEA archives, and we show that the computer vision SSL frameworks can be effective even for time series. In addition, we propose a new method that improves on the recently proposed VICReg method. Our method improves on a \textit{covariance} term proposed in VICReg, and in addition we augment the head of the architecture by an iterative normalization layer that accelerates the convergence of the model.
