A Review of Pseudo-Labeling for Computer Vision
Patrick Kage, Jay C. Rothenberger, Pavlos Andreadis, Dimitrios I. Diochnos
TL;DR
This survey unifies pseudo-labeling across semi-supervised, unsupervised, and self-supervised learning in computer vision by introducing a fuzzy-partition view of pseudo-labels. It catalogs a broad taxonomy of PL methods, including sample scheduling, weak supervision, consistency regularization, multi-model approaches, and knowledge distillation, and discusses how these ideas intersect through data filtering, curricula, and augmentation strategies. The paper highlights robustness to label noise and demonstrates how PL concepts permeate both SSL and UL methods, offering directions such as meta-learning for label assignment and self-supervised regularization to stabilize training. By synthesizing these threads, the work provides a framework for transferring advances across SSL, UL, and distillation, with practical implications for data-efficient learning in domains with scarce labels.
Abstract
Deep neural models have achieved state of the art performance on a wide range of problems in computer science, especially in computer vision. However, deep neural networks often require large datasets of labeled samples to generalize effectively, and an important area of active research is semi-supervised learning, which attempts to instead utilize large quantities of (easily acquired) unlabeled samples. One family of methods in this space is pseudo-labeling, a class of algorithms that use model outputs to assign labels to unlabeled samples which are then used as labeled samples during training. Such assigned labels, called pseudo-labels, are most commonly associated with the field of semi-supervised learning. In this work we explore a broader interpretation of pseudo-labels within both self-supervised and unsupervised methods. By drawing the connection between these areas we identify new directions when advancements in one area would likely benefit others, such as curriculum learning and self-supervised regularization.
