Forming Auxiliary High-confident Instance-level Loss to Promote Learning from Label Proportions

Tianhao Ma; Han Chen; Juncheng Hu; Yungang Zhu; Ximing Li

Forming Auxiliary High-confident Instance-level Loss to Promote Learning from Label Proportions

Tianhao Ma, Han Chen, Juncheng Hu, Yungang Zhu, Ximing Li

TL;DR

This work tackles LLP by addressing the frequent degradation caused by inaccurate, over-smoothed pseudo-labels when using large bags. It introduces L$^2$p-ahil, which couples a bag-level LLP loss with a high-confident instance-level loss through Dual Entropy-based Weighting (DEW) that combines bag- and instance-level entropies to gauge confidence. The method yields state-of-the-art results across multiple benchmarks, with notable gains as bag size increases, and demonstrates that adaptive weighting fosters more discriminative representations. The approach advances weakly-supervised learning in LLP by providing a principled, entropy-driven mechanism to selectively leverage pseudo-labels, with practical impact in settings where instance-level labels are costly or unavailable.

Abstract

Learning from label proportions (LLP), i.e., a challenging weakly-supervised learning task, aims to train a classifier by using bags of instances and the proportions of classes within bags, rather than annotated labels for each instance. Beyond the traditional bag-level loss, the mainstream methodology of LLP is to incorporate an auxiliary instance-level loss with pseudo-labels formed by predictions. Unfortunately, we empirically observed that the pseudo-labels are are often inaccurate due to over-smoothing, especially for the scenarios with large bag sizes, hurting the classifier induction. To alleviate this problem, we suggest a novel LLP method, namely Learning from Label Proportions with Auxiliary High-confident Instance-level Loss (L^2P-AHIL). Specifically, we propose a dual entropy-based weight (DEW) method to adaptively measure the confidences of pseudo-labels. It simultaneously emphasizes accurate predictions at the bag level and avoids overly smoothed predictions. We then form high-confident instance-level loss with DEW, and jointly optimize it with the bag-level loss in a self-training manner. The experimental results on benchmark datasets show that L^2P-AHIL can surpass the existing baseline methods, and the performance gain can be more significant as the bag size increases. The implementation of our method is available at https://github.com/TianhaoMa5/LLP-AHIL.

Forming Auxiliary High-confident Instance-level Loss to Promote Learning from Label Proportions

TL;DR

Abstract

Forming Auxiliary High-confident Instance-level Loss to Promote Learning from Label Proportions

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)