Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification

Chak Fong Chong; Jielong Guo; Xu Yang; Wei Ke; Yapeng Wang

Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification

Chak Fong Chong, Jielong Guo, Xu Yang, Wei Ke, Yapeng Wang

TL;DR

This work tackles partially labeled multi-label image classification by extending Mixup through LogicMix, a data augmentation that uses logical OR to combine labels and supports mixing multiple samples. By handling unknown labels with logical equivalences and alternating augmented samples with originals, LogicMix delivers stronger regularization with minimal training overhead. When integrated into RandAugment, Curriculum Labeling, and Category-wise Fine-Tuning, the approach achieves state-of-the-art mean average precision on MS-COCO, VG-200, and Pascal VOC 2007 across varying known-label proportions. The results demonstrate LogicMix’s generality, compatibility with other techniques, and potential for broad practical impact in data-scarce settings.

Abstract

Multi-label image classification datasets are often partially labeled where many labels are missing, posing a significant challenge to training accurate deep classifiers. However, the powerful Mixup sample-mixing data augmentation cannot be well utilized to address this challenge, as it cannot perform linear interpolation on the unknown labels to construct augmented samples. In this paper, we propose LogicMix, a Mixup variant designed for such partially labeled datasets. LogicMix mixes the sample labels by logical OR so that the unknown labels can be correctly mixed by utilizing OR's logical equivalences, including the domination and identity laws. Unlike Mixup, which mixes exactly two samples, LogicMix can mix multiple ($\geq2$) partially labeled samples, constructing visually more confused augmented samples to regularize training. LogicMix is more general and effective than other compared Mixup variants in the experiments on various partially labeled dataset scenarios. Moreover, it is plug-and-play and only requires minimal computation, hence it can be easily inserted into existing frameworks to collaborate with other methods to improve model performance with a negligible impact on training time, as demonstrated through extensive experiments. In particular, through the collaboration of LogicMix, RandAugment, Curriculum Labeling, and Category-wise Fine-Tuning, we attain state-of-the-art performance on MS-COCO, VG-200, and Pascal VOC 2007 benchmarking datasets. The remarkable generality, effectiveness, collaboration, and simplicity suggest that LogicMix promises to be a popular and vital data augmentation method.

Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification

TL;DR

Abstract

) partially labeled samples, constructing visually more confused augmented samples to regularize training. LogicMix is more general and effective than other compared Mixup variants in the experiments on various partially labeled dataset scenarios. Moreover, it is plug-and-play and only requires minimal computation, hence it can be easily inserted into existing frameworks to collaborate with other methods to improve model performance with a negligible impact on training time, as demonstrated through extensive experiments. In particular, through the collaboration of LogicMix, RandAugment, Curriculum Labeling, and Category-wise Fine-Tuning, we attain state-of-the-art performance on MS-COCO, VG-200, and Pascal VOC 2007 benchmarking datasets. The remarkable generality, effectiveness, collaboration, and simplicity suggest that LogicMix promises to be a popular and vital data augmentation method.

Paper Structure (48 sections, 14 equations, 4 figures, 9 tables)

This paper contains 48 sections, 14 equations, 4 figures, 9 tables.

Introduction
LogicMix
Mixing Partially Labeled Samples
Mixing Multiple ($\geq 2$) Samples
Construction of $\mathbf{x}'$
Construction of $\mathbf{y}'$
Alternating Between Augmented Samples and Original Samples
LogicMix Definition
LogicMix
Comparison to Mixup Variants for MLC
Impact to Training Time
Learning Framework with LogicMix
Data Augmentation with RandAugment and LogicMix
Curriculum Labeling
Category-wise Fine-Tuning
...and 33 more sections

Figures (4)

Figure 1: The overview of LogicMix.
Figure 2: Impact of LogicMix's hyperparameters to mAP% on MS-COCO. (Numerial results in Appendix \ref{['sec:hyparam_num']})
Figure 3: The overview of the framework with LogicMix.
Figure 4: The mean computation times per training epoch of using LogicMix with different number of data loader workers.

Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification

TL;DR

Abstract

Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification

Authors

TL;DR

Abstract

Table of Contents

Figures (4)