Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective

Yu Wang; Yuxuan Yin; Peng Li

Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective

Yu Wang, Yuxuan Yin, Peng Li

TL;DR

The paper addresses confirmation bias in semi-supervised learning by identifying that both the generation and utilization of pseudo labels contribute to biased learning. It introduces TaMatch, which computes a per-class scaling factor $\mathbf{r} = \frac{\mathbf{p}^{target}}{\mathbf{p}^{model}}$ and debiases weak-predictions via $RE(\mathbf{p}^w) = \text{Normalize}(\mathbf{p}^w \odot \mathbf{r})$, producing debiased pseudo labels. In the utilization phase, instances are weighted by $w = r(\arg\max(RE(\mathbf{p}^w)))$, and the target distribution $\mathbf{p}^{target}$ is dynamically updated with EMA, $\mathbf{p}_t^{target} = \lambda^{target} \mathbf{p}_{t-1}^{target} + (1-\lambda^{target}) \mathbf{p}_t^{model}$, with $\mathbf{p}_t^{model}$ as the EMA of batch predictions; an adaptive weight bound further stabilizes training. Empirical results on balanced and long-tailed image classification tasks show TaMatch outperforming state-of-the-art SSL baselines, demonstrating the importance of jointly debiasing the generation and utilization of pseudo labels for robust SSL.

Abstract

Semi-supervised learning (SSL) commonly exhibits confirmation bias, where models disproportionately favor certain classes, leading to errors in predicted pseudo labels that accumulate under a self-training paradigm. Unlike supervised settings, which benefit from a rich, static data distribution, SSL inherently lacks mechanisms to correct this self-reinforced bias, necessitating debiased interventions at each training step. Although the generation of debiased pseudo labels has been extensively studied, their effective utilization remains underexplored. Our analysis indicates that data from biased classes should have a reduced influence on parameter updates, while more attention should be given to underrepresented classes. To address these challenges, we introduce TaMatch, a unified framework for debiased training in SSL. TaMatch employs a scaling ratio derived from both a prior target distribution and the model's learning status to estimate and correct bias at each training step. This ratio adjusts the raw predictions on unlabeled data to produce debiased pseudo labels. In the utilization phase, these labels are differently weighted according to their predicted class, enhancing training equity and minimizing class bias. Additionally, TaMatch dynamically adjust the target distribution in response to the model's learning progress, facilitating robust handling of practical scenarios where the prior distribution is unknown. Empirical evaluations show that TaMatch significantly outperforms existing state-of-the-art methods across a range of challenging image classification tasks, highlighting the critical importance of both the debiased generation and utilization of pseudo labels in SSL.

Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective

TL;DR

and debiases weak-predictions via

, producing debiased pseudo labels. In the utilization phase, instances are weighted by

, and the target distribution

is dynamically updated with EMA,

, with

as the EMA of batch predictions; an adaptive weight bound further stabilizes training. Empirical results on balanced and long-tailed image classification tasks show TaMatch outperforming state-of-the-art SSL baselines, demonstrating the importance of jointly debiasing the generation and utilization of pseudo labels for robust SSL.

Abstract

Paper Structure (38 sections, 18 equations, 4 figures, 7 tables)

This paper contains 38 sections, 18 equations, 4 figures, 7 tables.

Introduction
Preliminaries
SSL with Hard Pseudo Labeling and Consistency Regularization
Confirmation Bias in SSL
Definition:
Bias Amplification in SSL:
Motivating Example:
Methodology
Overview of TaMatch
Calculation of the Scaling Factor:
Debiased Generation of Pseudo Labels:
Debiased Utilization of Pseudo Labels:
Dynamically Updated Target Distribution
Adaptive Weight Bound for Robust Training
Experiments
...and 23 more sections

Figures (4)

Figure 1: Probability of bias amplification with different initial $p_{1}$. $n$ is the number of samples drawn in each step.
Figure 2: Overview of the TaMatch framework with both debiased generation and utilization of pseudo labels.
Figure 3: Comparison between TaMatch and Fixmat on STL-10 with 40 labeled data. (a) Per-class accuracy; (b) KL divergence between $\bm{p}^{\text{model}}$ and $\bm{p}^{\text{truth}}$; (c) Utilization ratio of unlabeled data.
Figure 4: Error rate during training of TaMatch on the imbalanced CIFAR-10-LT with $N_l^{(1)}=500$, $N_u^{(1)}=4000$, and $\gamma=150$.

Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective

TL;DR

Abstract

Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective

Authors

TL;DR

Abstract

Table of Contents

Figures (4)