Calibratable Disambiguation Loss for Multi-Instance Partial-Label Learning

Wei Tang; Yin-Fang Yang; Weijia Zhang; Min-Ling Zhang

Calibratable Disambiguation Loss for Multi-Instance Partial-Label Learning

Wei Tang, Yin-Fang Yang, Weijia Zhang, Min-Ling Zhang

TL;DR

This work tackles calibration in dual-inexact supervision settings by introducing Calibratable Disambiguation Loss (CDL) for multi-instance partial-label learning (MIPL). CDL comes in two instantiations and is designed as a plug-and-play loss that improves both classification accuracy and probability calibration; it integrates with existing attention-based MIPL frameworks. The authors provide a theoretical lower bound showing CDL regularizes the training via a data-driven confidence margin, and they validate the approach with extensive experiments on benchmark and real-world datasets, achieving state-of-the-art accuracy and substantially lower ECE. They further analyze the mechanisms behind CDL’s success, including improved feature aggregation and more reliable label disambiguation, and offer practical guidance for selecting CDL variants and attention mechanisms depending on data complexity.

Abstract

Multi-instance partial-label learning (MIPL) is a weakly supervised framework that extends the principles of multi-instance learning (MIL) and partial-label learning (PLL) to address the challenges of inexact supervision in both instance and label spaces. However, existing MIPL approaches often suffer from poor calibration, undermining classifier reliability. In this work, we propose a plug-and-play calibratable disambiguation loss (CDL) that simultaneously improves classification accuracy and calibration performance. The loss has two instantiations: the first one calibrates predictions based on probabilities from the candidate label set, while the second one integrates probabilities from both candidate and non-candidate label sets. The proposed CDL can be seamlessly incorporated into existing MIPL and PLL frameworks. We provide a theoretical analysis that establishes the lower bound and regularization properties of CDL, demonstrating its superiority over conventional disambiguation losses. Experimental results on benchmark and real-world datasets confirm that our CDL significantly enhances both classification and calibration performance.

Calibratable Disambiguation Loss for Multi-Instance Partial-Label Learning

TL;DR

Abstract

Calibratable Disambiguation Loss for Multi-Instance Partial-Label Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (4)