RelationMatch: Matching In-batch Relationships for Semi-supervised Learning

Yifan Zhang; Jingqin Yang; Zhiquan Tan; Yang Yuan

RelationMatch: Matching In-batch Relationships for Semi-supervised Learning

Yifan Zhang, Jingqin Yang, Zhiquan Tan, Yang Yuan

TL;DR

RelationMatch introduces Matrix Cross-Entropy (MCE), a theoretically grounded loss that enforces in-batch relational consistency between weak and strong augmentations in semi-supervised learning. By representing predictions as density-like matrices and aligning their pairwise relations via MCE, the approach extends traditional cross-entropy with a principled, geometry-aware objective. Empirically, it yields strong gains across CIFAR-10/100 and STL-10, including notable improvements over state-of-the-art methods and compatibility with Curriculum Pseudo Labeling (CPL). The framework rests on solid connections to density matrices, von Neumann entropy, and information geometry, offering a versatile foundation for incorporating relational cues in SSL and beyond.

Abstract

Semi-supervised learning has emerged as a pivotal approach for leveraging scarce labeled data alongside abundant unlabeled data. Despite significant progress, prevailing SSL methods predominantly enforce consistency between different augmented views of individual samples, thereby overlooking the rich relational structure inherent within a mini-batch. In this paper, we present RelationMatch, a novel SSL framework that explicitly enforces in-batch relational consistency through a Matrix Cross-Entropy (MCE) loss function. The proposed MCE loss is rigorously derived from both matrix analysis and information geometry perspectives, ensuring theoretical soundness and practical efficacy. Extensive empirical evaluations on standard benchmarks, including a notable 15.21% accuracy improvement over FlexMatch on STL-10, demonstrate that RelationMatch not only advances state-of-the-art performance but also provides a principled foundation for incorporating relational cues in SSL.

RelationMatch: Matching In-batch Relationships for Semi-supervised Learning

TL;DR

Abstract

Paper Structure (27 sections, 17 theorems, 54 equations, 1 figure, 3 tables)

This paper contains 27 sections, 17 theorems, 54 equations, 1 figure, 3 tables.

Introduction
Matrix Cross-Entropy for Supervised and Semi-supervised Learning
Warm-up Example
Matrix Cross-Entropy
Matrix Cross-Entropy: Foundations and Interpretations
Density Matrices and the Matrix Logarithm
Von Neumann Entropy and Matrix Cross-Entropy
Information Geometrical Perspective of Matrix Cross-Entropy
Unveiling the Properties of Matrix Cross-Entropy
The Scalar Cross-Entropy as a Special Case of MCE
Desirable Properties of MCE
RelationMatch: Applying MCE for Semi-supervised Learning
Datasets
Experimental Details
Supervised Learning Results
...and 12 more sections

Key Result

Theorem 3.3

Let $\mathbf{A} \in \mathbb{C}^{n \times n}$ be a matrix with no eigenvalues on $\mathbb{R}^{-}$. Then there exists a unique matrix $\mathbf{X}$ such that with all eigenvalues of $\mathbf{X}$ lying in the strip $\{ z \in \mathbb{C} : -\pi < \operatorname{Im}(z) < \pi \}$. We refer to $\mathbf{X}$ as the principal logarithm of $\mathbf{A}$ and denote it by $\mathbf{X} = \log(\mathbf{A})$. If $\mat

Figures (1)

Figure 1: Pseudo-labels are generated by inputting a batch of weakly augmented images into the model. Subsequently, the model computes the probability distributions for the corresponding strongly augmented images. The loss function seamlessly integrates both the traditional cross-entropy loss and the proposed matrix cross-entropy loss.

Theorems & Definitions (23)

Definition 2.1: Matrix Cross-Entropy for Positive Semi-Definite Matrices
Definition 3.1: Density Matrix on $\mathbb{R}^{n \times n}$
Definition 3.2: Matrix Logarithm
Theorem 3.3: Principal Matrix Logarithm higham2008functions
Proposition 3.4
Lemma 3.5
Definition 3.6: Matrix Relative Entropy for Density Matrices
Theorem 3.7: Projection Theorem amari2014information
Proposition 3.8: Minimization Property
Lemma 4.1
...and 13 more

RelationMatch: Matching In-batch Relationships for Semi-supervised Learning

TL;DR

Abstract

RelationMatch: Matching In-batch Relationships for Semi-supervised Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (23)