FATE: A Prompt-Tuning-Based Semi-Supervised Learning Framework for Extremely Limited Labeled Data
Hezhao Liu, Yang Lu, Mengke Li, Yiqun Zhang, Shreyank N Gowda, Chen Gong, Hanzi Wang
TL;DR
FATE targets semi-supervised learning under extreme label scarcity by decoupling adaptation and classification into a two-stage prompt-tuning process. It first learns Distribution-adaptive Prompts $P_d$ from unlabeled data to align the backbone with downstream distributions, then uses Classification Prompts $P_c$ in a refined SSL objective to leverage both labeled and unlabeled data for final classification. The framework is shown to be effective for both vision and vision-language models, delivering substantial gains over state-of-the-art SSL and PEFT methods across multiple benchmarks. This approach enables robust SSL performance with minimal labeled data, highlighting the value of prompt-tuning and distribution-aware adaptation in settings with scarce supervision.
Abstract
Semi-supervised learning (SSL) has achieved significant progress by leveraging both labeled data and unlabeled data. Existing SSL methods overlook a common real-world scenario when labeled data is extremely scarce, potentially as limited as a single labeled sample in the dataset. General SSL approaches struggle to train effectively from scratch under such constraints, while methods utilizing pre-trained models often fail to find an optimal balance between leveraging limited labeled data and abundant unlabeled data. To address this challenge, we propose Firstly Adapt, Then catEgorize (FATE), a novel SSL framework tailored for scenarios with extremely limited labeled data. At its core, the two-stage prompt tuning paradigm FATE exploits unlabeled data to compensate for scarce supervision signals, then transfers to downstream tasks. Concretely, FATE first adapts a pre-trained model to the feature distribution of downstream data using volumes of unlabeled samples in an unsupervised manner. It then applies an SSL method specifically designed for pre-trained models to complete the final classification task. FATE is designed to be compatible with both vision and vision-language pre-trained models. Extensive experiments demonstrate that FATE effectively mitigates challenges arising from the scarcity of labeled samples in SSL, achieving an average performance improvement of 33.74% across seven benchmarks compared to state-of-the-art SSL methods. Code is available at https://anonymous.4open.science/r/Semi-supervised-learning-BA72.
