Appeal: Allow Mislabeled Samples the Chance to be Rectified in Partial Label Learning
Chongjie Si, Xuehui Wang, Yan Wang, Xiaokang Yang, Wei Shen
TL;DR
This work addresses mislabelled samples in partial label learning by introducing PLCP, an appeal-based framework that couples a base PLL classifier with a partner classifier that leverages non-candidate label information. A blurring mechanism and a collaborative term enable mutual supervision, allowing mislabeled instances to be rectified while avoiding overconfidence. Kernel and deep learning extensions broaden applicability, and extensive experiments on real-world PLL data and CIFAR demonstrate consistent improvements over baselines and state-of-the-art PLL methods, highlighting practical gains in robustness and disambiguation. The proposed approach enhances the practical reliability of PLL systems in domains with ambiguous supervision and mislabeled data.
Abstract
In partial label learning (PLL), each instance is associated with a set of candidate labels among which only one is ground-truth. The majority of the existing works focuses on constructing robust classifiers to estimate the labeling confidence of candidate labels in order to identify the correct one. However, these methods usually struggle to identify and rectify mislabeled samples. To help these mislabeled samples "appeal" for themselves and help existing PLL methods identify and rectify mislabeled samples, in this paper, we propose the first appeal-based PLL framework. Specifically, we introduce a novel partner classifier and instantiate it predicated on the implicit fact that non-candidate labels of a sample should not be assigned to it, which is inherently accurate and has not been fully investigated in PLL. Furthermore, a novel collaborative term is formulated to link the base classifier and the partner one. During each stage of mutual supervision, both classifiers will blur each other's predictions through a blurring mechanism to prevent overconfidence in a specific label. Extensive experiments demonstrate that the appeal and disambiguation ability of several well-established stand-alone and deep-learning based PLL approaches can be significantly improved by coupling with this learning paradigm.
