An Overview of Deep Semi-Supervised Learning
Yassine Ouali, Céline Hudelot, Myriam Tami
TL;DR
This paper surveys the landscape of deep semi-supervised learning, addressing the challenge of obtaining large labeled datasets by leveraging unlabeled data. It categorizes dominant Deep SSL approaches into consistency regularization, proxy-label methods, generative models, graph-based SSL, and self-supervision, detailing representative techniques such as MixMatch, FixMatch, VAT, Ladder Networks, and various GAN/VAEs methods. The work highlights key assumptions (smoothness, cluster, manifold) and practical evaluation guidelines, including fair baselines and dataset considerations. By connecting theoretical premises with a wide array of algorithms, the paper provides a consolidated reference for researchers and practitioners aiming to deploy data-efficient deep learning systems across vision and beyond.
Abstract
Deep neural networks demonstrated their ability to provide remarkable performances on a wide range of supervised learning tasks (e.g., image classification) when trained on extensive collections of labeled data (e.g., ImageNet). However, creating such large datasets requires a considerable amount of resources, time, and effort. Such resources may not be available in many practical cases, limiting the adoption and the application of many deep learning methods. In a search for more data-efficient deep learning methods to overcome the need for large annotated datasets, there is a rising research interest in semi-supervised learning and its applications to deep neural networks to reduce the amount of labeled data required, by either developing novel methods or adopting existing semi-supervised learning frameworks for a deep learning setting. In this paper, we provide a comprehensive overview of deep semi-supervised learning, starting with an introduction to the field, followed by a summarization of the dominant semi-supervised approaches in deep learning.
