Co-Training with Active Contrastive Learning and Meta-Pseudo-Labeling on 2D Projections for Deep Semi-Supervised Learning
David Aparco-Cardenas, Jancarlo F. Gomes, Alexandre X. Falcão, Pedro J. de Rezende
TL;DR
This paper tackles the challenge of training deep CNNs with limited labeled data by introducing active-DeepFA, a two-network co-training framework that blends supervised contrastive learning, meta-pseudo-labeling, and active learning within a DeepFA-based paradigm. It leverages label propagation on 2D projections of deep features via OPFSemi and periodically expands the labeled set by querying an oracle for the most informative samples, thereby reducing annotation effort. The approach achieves state-of-the-art results on three challenging biological image datasets with only 5% labeled data and remains competitive with as little as 3% labeled, demonstrating strong data efficiency and generalization. The combination of dual-network cross-training, proactive sample labeling, and 2D feature-space propagation offers a practical route to high-performance SSL for non-pretrained architectures in domains with scarce annotations and abundant unlabeled data.
Abstract
A major challenge that prevents the training of DL models is the limited availability of accurately labeled data. This shortcoming is highlighted in areas where data annotation becomes a time-consuming and error-prone task. In this regard, SSL tackles this challenge by capitalizing on scarce labeled and abundant unlabeled data; however, SoTA methods typically depend on pre-trained features and large validation sets to learn effective representations for classification tasks. In addition, the reduced set of labeled data is often randomly sampled, neglecting the selection of more informative samples. Here, we present active-DeepFA, a method that effectively combines CL, teacher-student-based meta-pseudo-labeling and AL to train non-pretrained CNN architectures for image classification in scenarios of scarcity of labeled and abundance of unlabeled data. It integrates DeepFA into a co-training setup that implements two cooperative networks to mitigate confirmation bias from pseudo-labels. The method starts with a reduced set of labeled samples by warming up the networks with supervised CL. Afterward and at regular epoch intervals, label propagation is performed on the 2D projections of the networks' deep features. Next, the most reliable pseudo-labels are exchanged between networks in a cross-training fashion, while the most meaningful samples are annotated and added into the labeled set. The networks independently minimize an objective loss function comprising supervised contrastive, supervised and semi-supervised loss components, enhancing the representations towards image classification. Our approach is evaluated on three challenging biological image datasets using only 5% of labeled samples, improving baselines and outperforming six other SoTA methods. In addition, it reduces annotation effort by achieving comparable results to those of its counterparts with only 3% of labeled data.
