Mixed Blessing: Class-Wise Embedding guided Instance-Dependent Partial Label Learning
Fuchao Yang, Jianhong Cheng, Hui Liu, Yongqiang Dong, Yuheng Jia, Junhui Hou
TL;DR
This work tackles instance-dependent partial label learning (IDPLL), where candidate labels are highly correlated with sample features, offering both informative supervision and high label ambiguity. It introduces a three-module framework that learns per-sample class-wise embeddings, enabling explicit modeling of relationships among class labels. Two losses—class associative loss $L_{cal}$ and prototype discriminative loss $L_{pdl}$—drive embeddings to cluster within candidate labels while aligning high-confidence predictions with global class prototypes. Training is staged: first optimize $\,\mathcal{L}_{cls} + \alpha \, \mathcal{L}_{cal}$, then incorporate $eta \, \mathcal{L}_{pdl}$ to form $\\mathcal{L}_{all} = \, \mathcal{L}_{cls} + \alpha \, \mathcal{L}_{cal} + \beta \, \mathcal{L}_{pdl}$. Experiments on six benchmarks show consistent improvements over twelve baselines, with particular gains on fine-grained datasets and faster early learning, and code is publicly available at https://github.com/Yangfc-ML/CEL.
Abstract
In partial label learning (PLL), every sample is associated with a candidate label set comprising the ground-truth label and several noisy labels. The conventional PLL assumes the noisy labels are randomly generated (instance-independent), while in practical scenarios, the noisy labels are always instance-dependent and are highly related to the sample features, leading to the instance-dependent partial label learning (IDPLL) problem. Instance-dependent noisy label is a double-edged sword. On one side, it may promote model training as the noisy labels can depict the sample to some extent. On the other side, it brings high label ambiguity as the noisy labels are quite undistinguishable from the ground-truth label. To leverage the nuances of IDPLL effectively, for the first time we create class-wise embeddings for each sample, which allow us to explore the relationship of instance-dependent noisy labels, i.e., the class-wise embeddings in the candidate label set should have high similarity, while the class-wise embeddings between the candidate label set and the non-candidate label set should have high dissimilarity. Moreover, to reduce the high label ambiguity, we introduce the concept of class prototypes containing global feature information to disambiguate the candidate label set. Extensive experimental comparisons with twelve methods on six benchmark data sets, including four fine-grained data sets, demonstrate the effectiveness of the proposed method. The code implementation is publicly available at https://github.com/Yangfc-ML/CEL.
