A Channel-ensemble Approach: Unbiased and Low-variance Pseudo-labels is Critical for Semi-supervised Classification
Jiaqi Wu, Junbiao Pang, Baochang Zhang, Qingming Huang
TL;DR
This work tackles the bias and high-variance issues of pseudo-labels in semi-supervised classification under scarce labels. It proposes Channel-Based Ensemble (CBE), a lightweight, plug-and-play multi-head ensemble with two branches, complemented by a Chebyshev constraint and two loss terms—Low Bias Loss and Low Variance Loss—to produce unbiased, low-variance pseudo-labels and stabilize self-training. Empirical results on CIFAR-10 and CIFAR-100 show that integrating CBE with FixMatch or FreeMatch yields substantial accuracy gains with minimal computational overhead, and ablations confirm the effectiveness of the LB and LV components. The method provides a general, efficient pathway to improve SSL PL quality across frameworks, with practical impact for low-label regimes and scalable deployment.
Abstract
Semi-supervised learning (SSL) is a practical challenge in computer vision. Pseudo-label (PL) methods, e.g., FixMatch and FreeMatch, obtain the State Of The Art (SOTA) performances in SSL. These approaches employ a threshold-to-pseudo-label (T2L) process to generate PLs by truncating the confidence scores of unlabeled data predicted by the self-training method. However, self-trained models typically yield biased and high-variance predictions, especially in the scenarios when a little labeled data are supplied. To address this issue, we propose a lightweight channel-based ensemble method to effectively consolidate multiple inferior PLs into the theoretically guaranteed unbiased and low-variance one. Importantly, our approach can be readily extended to any SSL framework, such as FixMatch or FreeMatch. Experimental results demonstrate that our method significantly outperforms state-of-the-art techniques on CIFAR10/100 in terms of effectiveness and efficiency.
