Table of Contents
Fetching ...

BEM: Balanced and Entropy-based Mix for Long-Tailed Semi-Supervised Learning

Hongwei Zheng, Linyuan Zhou, Han Li, Jinming Su, Xiaoming Wei, Xiaoming Xu

TL;DR

The paper addresses LTSSL by tackling both data quantity imbalance and class-wise uncertainty. It proposes Balanced and Entropy-based Mix (BEM), which combines CamMix for localized data mixing, a Class Balanced Mix Bank (CBMB) driven by the effective number $E_c$, and an entropy-based learning (EL) module to balance per-class uncertainty through entropy-based sampling, masking, and a class-balanced loss. Key contributions include the CAM-based mixing region (CamMix), a principled sampling scheme leveraging $E_c$ and EMA-estimated class distributions, and an entropy-integrated training objective that jointly accounts for data quantity and uncertainty. Empirically, BEM consistently improves LTSSL baselines across CIFAR10/100-LT, STL10-LT, and ImageNet-127, achieving state-of-the-art results and proving its versatility as a complementary component to existing re-balancing methods.

Abstract

Data mixing methods play a crucial role in semi-supervised learning (SSL), but their application is unexplored in long-tailed semi-supervised learning (LTSSL). The primary reason is that the in-batch mixing manner fails to address class imbalance. Furthermore, existing LTSSL methods mainly focus on re-balancing data quantity but ignore class-wise uncertainty, which is also vital for class balance. For instance, some classes with sufficient samples might still exhibit high uncertainty due to indistinguishable features. To this end, this paper introduces the Balanced and Entropy-based Mix (BEM), a pioneering mixing approach to re-balance the class distribution of both data quantity and uncertainty. Specifically, we first propose a class balanced mix bank to store data of each class for mixing. This bank samples data based on the estimated quantity distribution, thus re-balancing data quantity. Then, we present an entropy-based learning approach to re-balance class-wise uncertainty, including entropy-based sampling strategy, entropy-based selection module, and entropy-based class balanced loss. Our BEM first leverages data mixing for improving LTSSL, and it can also serve as a complement to the existing re-balancing methods. Experimental results show that BEM significantly enhances various LTSSL frameworks and achieves state-of-the-art performances across multiple benchmarks.

BEM: Balanced and Entropy-based Mix for Long-Tailed Semi-Supervised Learning

TL;DR

The paper addresses LTSSL by tackling both data quantity imbalance and class-wise uncertainty. It proposes Balanced and Entropy-based Mix (BEM), which combines CamMix for localized data mixing, a Class Balanced Mix Bank (CBMB) driven by the effective number , and an entropy-based learning (EL) module to balance per-class uncertainty through entropy-based sampling, masking, and a class-balanced loss. Key contributions include the CAM-based mixing region (CamMix), a principled sampling scheme leveraging and EMA-estimated class distributions, and an entropy-integrated training objective that jointly accounts for data quantity and uncertainty. Empirically, BEM consistently improves LTSSL baselines across CIFAR10/100-LT, STL10-LT, and ImageNet-127, achieving state-of-the-art results and proving its versatility as a complementary component to existing re-balancing methods.

Abstract

Data mixing methods play a crucial role in semi-supervised learning (SSL), but their application is unexplored in long-tailed semi-supervised learning (LTSSL). The primary reason is that the in-batch mixing manner fails to address class imbalance. Furthermore, existing LTSSL methods mainly focus on re-balancing data quantity but ignore class-wise uncertainty, which is also vital for class balance. For instance, some classes with sufficient samples might still exhibit high uncertainty due to indistinguishable features. To this end, this paper introduces the Balanced and Entropy-based Mix (BEM), a pioneering mixing approach to re-balance the class distribution of both data quantity and uncertainty. Specifically, we first propose a class balanced mix bank to store data of each class for mixing. This bank samples data based on the estimated quantity distribution, thus re-balancing data quantity. Then, we present an entropy-based learning approach to re-balance class-wise uncertainty, including entropy-based sampling strategy, entropy-based selection module, and entropy-based class balanced loss. Our BEM first leverages data mixing for improving LTSSL, and it can also serve as a complement to the existing re-balancing methods. Experimental results show that BEM significantly enhances various LTSSL frameworks and achieves state-of-the-art performances across multiple benchmarks.
Paper Structure (19 sections, 12 equations, 11 figures, 17 tables, 2 algorithms)

This paper contains 19 sections, 12 equations, 11 figures, 17 tables, 2 algorithms.

Figures (11)

  • Figure 1: Experimental results on CIFAR10-LT krizhevsky2009learning. (a)-(c): Class distribution of unlabeled data quantity and entropy for three typical settings, which have the same labeled data quantity distribution but differ in unlabeled ones. Both the data quantity and entropy are the statistical averages within one epoch after model convergence. Unexpected discrepancies are observed across all settings between the distribution of data quantity and entropy, particularly for head and tail classes. Notably, classes 3-6 exhibit the highest entropy, indicating greater uncertainty. (d): Test accuracy gain brought by BEM for various LTSSL frameworks in consistent setting.
  • Figure 2: Left: The overview of Balanced and Entropy-based Mixing (BEM), incorporating with FixMatch sohn2020fixmatch as an example in this figure. BEM consists of two sub-modules: class balanced mix bank (CBMB) and entropy-based learning (EL). CBMB re-balances data quantity through the proposed CamMix, guided by a class-balanced sampling function. EL further re-balances class-wise uncertainty using three techniques: entropy-based sampling strategy (ESS), entropy-based selection module (ESM) and entropy-based class balanced loss ($L_{ecb}$). Right: The sampling and CamMix process of BEM. The sampling process considers both the class distribution of data quantity and uncertainty, which are estimated on the fly. CamMix extracts the bounding box from the high response area of the CAM to form mixed data. (The lock icon denotes the unknown distribution that needs estimation, and the $\oplus$ icon denotes the process of CamMix.).
  • Figure 3: Class distribution of data quantity and entropy in three settings. Each mixed data is calculated as containing two classes.
  • Figure 4: Comparison of t-SNE visualization with combinations of FixMatch and ACR.
  • Figure 5: The visualization of data mixing process for CutMix, SaliencyMix, and CamMix on STL10-LT. The red box indicates the image area selected by data mixing.
  • ...and 6 more figures