Noisy-Pair Robust Representation Alignment for Positive-Unlabeled Learning

Hengwei Zhao; Zhengzhong Tu; Zhuo Zheng; Wei Wang; Junjue Wang; Rusty Feagin; Wenzhe Jiao

Noisy-Pair Robust Representation Alignment for Positive-Unlabeled Learning

Hengwei Zhao, Zhengzhong Tu, Zhuo Zheng, Wei Wang, Junjue Wang, Rusty Feagin, Wenzhe Jiao

TL;DR

The paper targets the core bottleneck in positive-unlabeled learning: learning discriminative representations under unreliable supervision. It introduces NcPU, a non-contrastive PU framework that combines NoiSNCL, a noisy-pair robust intra-class alignment loss, with PLD, a phantom label disambiguation scheme based on class prototypes and regret-based updates. The authors provide EM-inspired theoretical justification showing how NoiSNCL and PLD mutually reinforce each other, and they demonstrate substantial empirical gains across standard benchmarks and challenging remote-sensing datasets without requiring auxiliary negatives or priors. The results reveal that NcPU closes much of the gap to supervised performance and offers robust, scalable performance for real-world weakly supervised tasks, with broad applicability in areas such as post-disaster building damage mapping. The work suggests a promising direction for non-contrastive, prototype-informed PU learning and broader weak supervision contexts, with code to be released after review.

Abstract

Positive-Unlabeled (PU) learning aims to train a binary classifier (positive vs. negative) where only limited positive data and abundant unlabeled data are available. While widely applicable, state-of-the-art PU learning methods substantially underperform their supervised counterparts on complex datasets, especially without auxiliary negatives or pre-estimated parameters (e.g., a 14.26% gap on CIFAR-100 dataset). We identify the primary bottleneck as the challenge of learning discriminative representations under unreliable supervision. To tackle this challenge, we propose NcPU, a non-contrastive PU learning framework that requires no auxiliary information. NcPU combines a noisy-pair robust supervised non-contrastive loss (NoiSNCL), which aligns intra-class representations despite unreliable supervision, with a phantom label disambiguation (PLD) scheme that supplies conservative negative supervision via regret-based label updates. Theoretically, NoiSNCL and PLD can iteratively benefit each other from the perspective of the Expectation-Maximization framework. Empirically, extensive experiments demonstrate that: (1) NoiSNCL enables simple PU methods to achieve competitive performance; and (2) NcPU achieves substantial improvements over state-of-the-art PU methods across diverse datasets, including challenging datasets on post-disaster building damage mapping, highlighting its promise for real-world applications. Code: Code will be open-sourced after review.

Noisy-Pair Robust Representation Alignment for Positive-Unlabeled Learning

TL;DR

Abstract

Noisy-Pair Robust Representation Alignment for Positive-Unlabeled Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (1)