Meta-learning for Positive-unlabeled Classification

Atsutoshi Kumagai; Tomoharu Iwata; Yasuhiro Fujiwara

Meta-learning for Positive-unlabeled Classification

Atsutoshi Kumagai, Tomoharu Iwata, Yasuhiro Fujiwara

TL;DR

This work tackles positive-unlabeled classification under data-scarce, cross-task settings by introducing a meta-learning framework that adapts to PU data on unseen tasks. The key idea is to estimate the Bayes optimal classifier through a task-specific density-ratio between PU and marginal densities and a positive class-prior, where task representations from permutation-invariant networks enable flexible, task-conditioned embeddings. The adaptation admits a closed-form solution for the density-ratio parameters, enabling efficient meta-training that minimizes the test classification risk across related tasks. Empirical results on synthetic and real datasets show the proposed method outperforming standard PU methods and PU-aware meta-learning baselines, with robust performance even when the target priors are unknown. The approach holds potential for rapid, data-efficient PU learning in applications like outlier detection, information retrieval, and personalized systems.

Abstract

We propose a meta-learning method for positive and unlabeled (PU) classification, which improves the performance of binary classifiers obtained from only PU data in unseen target tasks. PU learning is an important problem since PU data naturally arise in real-world applications such as outlier detection and information retrieval. Existing PU learning methods require many PU data, but sufficient data are often unavailable in practice. The proposed method minimizes the test classification risk after the model is adapted to PU data by using related tasks that consist of positive, negative, and unlabeled data. We formulate the adaptation as an estimation problem of the Bayes optimal classifier, which is an optimal classifier to minimize the classification risk. The proposed method embeds each instance into a task-specific space using neural networks. With the embedded PU data, the Bayes optimal classifier is estimated through density-ratio estimation of PU densities, whose solution is obtained as a closed-form solution. The closed-form solution enables us to efficiently and effectively minimize the test classification risk. We empirically show that the proposed method outperforms existing methods with one synthetic and three real-world datasets.

Meta-learning for Positive-unlabeled Classification

TL;DR

Abstract

Paper Structure (30 sections, 15 equations, 25 figures, 9 tables, 1 algorithm)

This paper contains 30 sections, 15 equations, 25 figures, 9 tables, 1 algorithm.

Introduction
Related Work
Preliminary
Proposed Method
Problem Formulation
Model
Task Representation Calculation
Density-ratio Estimation
Class-prior Estimation
Meta-training
Experiments
Data
Comparison Methods
Results
Conclusion
...and 15 more sections

Figures (25)

Figure 1: Our meta-learning procedure: (1) For each training iteration, we randomly sample PU data (support set) and positive and negative (PN) data (query set) from a randomly selected source task. (2) Task representation vectors, ${\bf z} ^{{\rm p}}$ and ${\bf z} ^{{\rm u}}$, are calculated from positive and unlabeled support data, respectively, by using the permutation-invariant neural networks (Section \ref{['taskexp']}). (3) With ${\bf z} ^{{\rm p}}$ and ${\bf z} ^{{\rm u}}$, all instances are embedded into a task-specific space by using a neural network. (4) By using the embedded support set, density-ratio estimation is performed and its closed-form solution ${\hat{r}} _{\ast}$ is obtained (Section \ref{['denexp']}). (5) By using ${\hat{r}} _{\ast}$ and the embedded support set, positive class-prior ${\hat{\pi}^{{\rm p}}} _{\ast}$ is estimated (Section \ref{['classexp']}). (6) By using ${\hat{r}} _{\ast}$ and ${\hat{\pi}^{{\rm p}}} _{\ast}$, the estimated Bayes optimal classifier $s_{\ast}$ is obtained (Section \ref{['classexp']}). (7) Test classification risk (loss) is calculated with the query set and the estimated Bayes optimal classifier, and it can be backpropagated to update all the neural networks (Section \ref{['trains']}).
Figure 2: Synthetic
Figure 3: Mnist-r
Figure 4: Isolet
Figure 5: IoT
...and 20 more figures

Meta-learning for Positive-unlabeled Classification

TL;DR

Abstract

Meta-learning for Positive-unlabeled Classification

Authors

TL;DR

Abstract

Table of Contents

Figures (25)