Boosting Single Positive Multi-label Classification with Generalized Robust Loss

Yanxi Chen; Chunxiao Li; Xinyang Dai; Jinhuan Li; Weiyu Sun; Yiming Wang; Renyuan Zhang; Tinghe Zhang; Bo Wang

Boosting Single Positive Multi-label Classification with Generalized Robust Loss

Yanxi Chen, Chunxiao Li, Xinyang Dai, Jinhuan Li, Weiyu Sun, Yiming Wang, Renyuan Zhang, Tinghe Zhang, Bo Wang

TL;DR

This paper tackles SPML where each sample has only one verified positive label, addressing the resultant false negatives and label imbalance. It introduces Generalized Robust Loss (GR Loss), a framework built on expected risk minimization that combines soft pseudo-labeling via a calibrated $\hat{k}(p;\beta)$, instance/class-aware reweighting $v(p;\alpha)$, and three robust surrogate losses $\mathcal{L}_1, \mathcal{L}_2, \mathcal{L}_3$ to form a unified loss. The method unifies and extends existing SPML/MLML losses by mapping them into the GR framework, and it demonstrates state-of-the-art performance on VOC, COCO, and NUS benchmarks, with competitive results on CUB due to label-correlation effects. The work provides gradient analyses and ablations that highlight the importance of pseudo-label calibration and robust loss design for handling missing labels and intra-/inter-class imbalance, offering practical gains for SPML applications while noting limitations and future directions such as dynamic loss parameters and explicit label correlations.

Abstract

Multi-label learning (MLL) requires comprehensive multi-semantic annotations that is hard to fully obtain, thus often resulting in missing labels scenarios. In this paper, we investigate Single Positive Multi-label Learning (SPML), where each image is associated with merely one positive label. Existing SPML methods only focus on designing losses using mechanisms such as hard pseudo-labeling and robust losses, mostly leading to unacceptable false negatives. To address this issue, we first propose a generalized loss framework based on expected risk minimization to provide soft pseudo labels, and point out that the former losses can be seamlessly converted into our framework. In particular, we design a novel robust loss based on our framework, which enjoys flexible coordination between false positives and false negatives, and can additionally deal with the imbalance between positive and negative samples. Extensive experiments show that our approach can significantly improve SPML performance and outperform the vast majority of state-of-the-art methods on all the four benchmarks.

Boosting Single Positive Multi-label Classification with Generalized Robust Loss

TL;DR

, instance/class-aware reweighting

, and three robust surrogate losses

to form a unified loss. The method unifies and extends existing SPML/MLML losses by mapping them into the GR framework, and it demonstrates state-of-the-art performance on VOC, COCO, and NUS benchmarks, with competitive results on CUB due to label-correlation effects. The work provides gradient analyses and ablations that highlight the importance of pseudo-label calibration and robust loss design for handling missing labels and intra-/inter-class imbalance, offering practical gains for SPML applications while noting limitations and future directions such as dynamic loss parameters and explicit label correlations.

Abstract

Paper Structure (49 sections, 34 equations, 5 figures, 6 tables)

This paper contains 49 sections, 34 equations, 5 figures, 6 tables.

Introduction
Related Work
Method
Problem Definition
Expected Risk Estimation
Novel SPML Loss Framework
Pseudo-labeling.
Loss reweighting.
The analysis and formulation of $\hat{k}(p;\beta)$
The formulation.
The update of $\beta$.
The analysis and formulation of $v(p;\alpha)$
The formulation.
The update of $\alpha$.
The analysis and formulation of $\mathcal{L}_1,\mathcal{L}_2,\mathcal{L}_3$
...and 34 more sections

Figures (5)

Figure 1: The gradient analysis. Better viewed in color.
Figure 2: The mAP (%) of GR Loss with different hyperparameters on VOC. Better viewed on screen and in color.
Figure 3: Distinguishing ability of model predictions on VOC. Better viewed on screen and in color.
Figure 4: The numbers of different samples w.r.t. the class. The number of positive labels (i.e., pos), the number of positive labels among the missing labels, i.e., pos (missing), and total number of samples represented by a dashed line for each class in four datasets.
Figure 5: The average values of FN/(FN+TN) across 100 intervals on the VOC and COCO. (a) represents the beginning of training on VOC, (b) is the beginning of training on COCO, (c) illustrates the end of training along with $\hat{k}(p)$ curve (the blue one) in Eq.\ref{['kx']}, where $a\!=\!1/1.46$, and (d) displays the end of training along with $\hat{k}(p)$ curve (the blue one) in Eq.\ref{['kx']}, where $a\!=\!1/2.94$.

Theorems & Definitions (2)

Remark 1
Remark 2

Boosting Single Positive Multi-label Classification with Generalized Robust Loss

TL;DR

Abstract

Boosting Single Positive Multi-label Classification with Generalized Robust Loss

Authors

TL;DR

Abstract

Table of Contents

Figures (5)

Theorems & Definitions (2)