OTMatch: Improving Semi-Supervised Learning with Optimal Transport
Zhiquan Tan, Kaipeng Zheng, Weiran Huang
TL;DR
OTMatch tackles the overconfidence problem in pseudo-labeling for semi-supervised learning by introducing an optimal transport loss that encodes inter-class semantic relationships. By bootstrapping the transport cost from the model's learning dynamics and using a teacher–student EMA framework with dual augmentations, OTMatch aligns semantic distributions between predictions and pseudo-labels, improving robustness especially with scarce labeled data. Empirically, OTMatch delivers state-of-the-art or competitive gains on vision benchmarks (CIFAR-10/100, STL-10, ImageNet) and multilingual NLP datasets, with a low computational overhead of $O(K)$ for the OT term and effective integration with existing SSL methods. The approach advances semi-supervised learning by treating class relations as a learnable, data-driven regularizer and paves the way for broader use of OT-based semantic distribution matching in self-supervised and multi-modal settings.
Abstract
Semi-supervised learning has made remarkable strides by effectively utilizing a limited amount of labeled data while capitalizing on the abundant information present in unlabeled data. However, current algorithms often prioritize aligning image predictions with specific classes generated through self-training techniques, thereby neglecting the inherent relationships that exist within these classes. In this paper, we present a new approach called OTMatch, which leverages semantic relationships among classes by employing an optimal transport loss function to match distributions. We conduct experiments on many standard vision and language datasets. The empirical results show improvements in our method above baseline, this demonstrates the effectiveness and superiority of our approach in harnessing semantic relationships to enhance learning performance in a semi-supervised setting.
