Rethinking Consistent Multi-Label Classification under Inexact Supervision
Wei Wang, Tianhao Ma, Ming-Kun Xie, Gang Niu, Masashi Sugiyama
TL;DR
The paper tackles multi-label classification under inexact supervision, introducing COMES, a unified framework that yields unbiased risk estimators for both first-order (Hamming) and second-order (ranking) objectives without relying on estimating the label-generation process or enforcing a uniform distribution. It provides theoretical guarantees of consistency and convergence rates, augmented by risk-correction mechanisms to improve generalization. Empirical results show that COMES-HL and COMES-RL consistently outperform state-of-the-art baselines on real-world and synthetic datasets, validating robustness to label imbalances and inexact supervision. The work advances practical weakly supervised MLC by modeling a realistic data-generation process and delivering provable, scalable learning algorithms.
Abstract
Partial multi-label learning and complementary multi-label learning are two popular weakly supervised multi-label classification paradigms that aim to alleviate the high annotation costs of collecting precisely annotated multi-label data. In partial multi-label learning, each instance is annotated with a candidate label set, among which only some labels are relevant; in complementary multi-label learning, each instance is annotated with complementary labels indicating the classes to which the instance does not belong. Existing consistent approaches for the two paradigms either require accurate estimation of the generation process of candidate or complementary labels or assume a uniform distribution to eliminate the estimation problem. However, both conditions are usually difficult to satisfy in real-world scenarios. In this paper, we propose consistent approaches that do not rely on the aforementioned conditions to handle both problems in a unified way. Specifically, we propose two unbiased risk estimators based on first- and second-order strategies. Theoretically, we prove consistency w.r.t. two widely used multi-label classification evaluation metrics and derive convergence rates for the estimation errors of the proposed risk estimators. Empirically, extensive experimental results validate the effectiveness of our proposed approaches against state-of-the-art methods.
