Can Class-Priors Help Single-Positive Multi-Label Learning?
Biao Liu, Ning Xu, Jie Wang, Xin Geng
TL;DR
This work tackles single-positive multi-label learning by relaxing the common equal-class-prior assumption. It introduces Crisp, a framework that (i) estimates per-class priors from a black-box classifier via a threshold-based mechanism that converges to the true priors, and (ii) derives an unbiased, priors-guided risk estimator with learning guarantees that approximate the fully supervised optimum. The approach yields theoretical convergence bounds and demonstrates superior performance across ten benchmark datasets, with robust priors prediction and favorable training efficiency. Overall, Crisp provides a principled, practical solution for SPMLL that mitigates bias from imbalanced class priors and improves multi-label prediction quality.
Abstract
Single-positive multi-label learning (SPMLL) is a typical weakly supervised multi-label learning problem, where each training example is annotated with only one positive label. Existing SPMLL methods typically assign pseudo-labels to unannotated labels with the assumption that prior probabilities of all classes are identical. However, the class-prior of each category may differ significantly in real-world scenarios, which makes the predictive model not perform as well as expected due to the unrealistic assumption on real-world application. To alleviate this issue, a novel framework named {\proposed}, i.e., Class-pRiors Induced Single-Positive multi-label learning, is proposed. Specifically, a class-priors estimator is introduced, which could estimate the class-priors that are theoretically guaranteed to converge to the ground-truth class-priors. In addition, based on the estimated class-priors, an unbiased risk estimator for classification is derived, and the corresponding risk minimizer could be guaranteed to approximately converge to the optimal risk minimizer on fully supervised data. Experimental results on ten MLL benchmark datasets demonstrate the effectiveness and superiority of our method over existing SPMLL approaches.
