GradPU: Positive-Unlabeled Learning via Gradient Penalty and Positive Upweighting

Songmin Dai; Xiaoqiang Li; Yue Zhou; Xichen Ye; Tong Liu

doi:10.1609/aaai.v37i6.25889

Authors

Songmin Dai Shanghai University
Xiaoqiang Li Shanghai University
Yue Zhou Shanghai University
Xichen Ye Shanghai University
Tong Liu Shanghai University

DOI:

https://doi.org/10.1609/aaai.v37i6.25889

Keywords:

ML: Classification and Regression, CV: Object Detection & Categorization, ML: Semi-Supervised Learning, ML: Unsupervised & Self-Supervised Learning

Abstract

Positive-unlabeled learning is an essential problem in many real-world applications with only labeled positive and unlabeled data, especially when the negative samples are difficult to identify. Most existing positive-unlabeled learning methods will inevitably overfit the positive class to some extent due to the existence of unidentified positive samples. This paper first analyzes the overfitting problem and proposes to bound the generalization errors via Wasserstein distances. Based on that, we develop a simple yet effective positive-unlabeled learning method, GradPU, which consists of two key ingredients: A gradient-based regularizer that penalizes the gradient norms in the interpolated data region, which improves the generalization of positive class; An unnormalized upweighting mechanism that assigns larger weights to those positive samples that are hard, not-well-fitted and less frequently labeled. It enforces the training error of each positive sample to be small and increases the robustness to the labeling bias. We evaluate our proposed GradPU on three datasets: MNIST, FashionMNIST, and CIFAR10. The results demonstrate that GradPU achieves state-of-the-art performance on both unbiased and biased positive labeling scenarios.

GradPU: Positive-Unlabeled Learning via Gradient Penalty and Positive Upweighting

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription