Label-efficient Time Series Representation Learning: A Review
Emadeldeen Eldele, Mohamed Ragab, Zhenghua Chen, Min Wu, Chee-Keong Kwoh, Xiaoli Li
TL;DR
This survey tackles the challenge of learning effective time series representations with limited labeled data. It introduces a taxonomy that separates in-domain from cross-domain strategies, and surveys data augmentation, self-supervised learning, semi-supervised learning, and domain adaptation within these categories. The work highlights concrete methods, their losses, and representative techniques (e.g., InfoNCE, MMD, adversarial domain adaptation), analyzes their advantages and limitations, and advocates hybrid approaches and standardized benchmarks. The findings underscore the practical importance of label-efficient methods for domains like healthcare and industry, where labeled data are scarce but unlabeled data abound, and they provide a roadmap for future research in hybrid strategies and domain-transfer-aware design.
Abstract
Label-efficient time series representation learning, which aims to learn effective representations with limited labeled data, is crucial for deploying deep learning models in real-world applications. To address the scarcity of labeled time series data, various strategies, e.g., transfer learning, self-supervised learning, and semi-supervised learning, have been developed. In this survey, we introduce a novel taxonomy for the first time, categorizing existing approaches as in-domain or cross-domain, based on their reliance on external data sources or not. Furthermore, we present a review of the recent advances in each strategy, conclude the limitations of current methodologies, and suggest future research directions that promise further improvements in the field.
