Flexible Distribution Alignment: Towards Long-tailed Semi-supervised Learning with Proper Calibration
Emanuel Sanchez Aimar, Nathaniel Helgesen, Yonghao Xu, Marco Kuhlmann, Michael Felsberg
TL;DR
This work tackles long-tailed semi-supervised learning under label shift by introducing ADELLO, a framework that combines Flexible Distribution Alignment (FlexDA) with Complementary Consistency Regularization (CCR). FlexDA dynamically aligns the classifier to the unknown unlabeled data distribution using a time-smoothed target prior ${\hat Q}_{\alpha_t}$ and logit-adjusted losses that evolve toward a balanced prior, improving data utilization and debiasing during training. CCR exploits low-confidence pseudo-labels via masked distillation at a controlled temperature, enabling broader data usage and mitigating confirmation bias. Across CIFAR-LT, STL10-LT, and ImageNet127, ADELLO delivers state-of-the-art LTSSL performance and substantially better calibration (lower ECE/MCE), demonstrating robustness to various label-shift scenarios and practical impact for scalable, well-calibrated semi-supervised learning. The method preserves computational efficiency by not adding forward passes or extra classifiers, making it appealing for real-world LTSSL deployments.
Abstract
Long-tailed semi-supervised learning (LTSSL) represents a practical scenario for semi-supervised applications, challenged by skewed labeled distributions that bias classifiers. This problem is often aggravated by discrepancies between labeled and unlabeled class distributions, leading to biased pseudo-labels, neglect of rare classes, and poorly calibrated probabilities. To address these issues, we introduce Flexible Distribution Alignment (FlexDA), a novel adaptive logit-adjusted loss framework designed to dynamically estimate and align predictions with the actual distribution of unlabeled data and achieve a balanced classifier by the end of training. FlexDA is further enhanced by a distillation-based consistency loss, promoting fair data usage across classes and effectively leveraging underconfident samples. This method, encapsulated in ADELLO (Align and Distill Everything All at Once), proves robust against label shift, significantly improves model calibration in LTSSL contexts, and surpasses previous state-of-of-art approaches across multiple benchmarks, including CIFAR100-LT, STL10-LT, and ImageNet127, addressing class imbalance challenges in semi-supervised learning. Our code is available at https://github.com/emasa/ADELLO-LTSSL.
