Margin Based PU Learning

Tieliang Gong; Guangtao Wang; Jieping Ye; Zongben Xu; Ming Lin

doi:10.1609/aaai.v32i1.11698

Authors

Tieliang Gong Xi'an Jiaotong University
Guangtao Wang University of Michigan
Jieping Ye University of Michigan
Zongben Xu Xi'an Jiaotong University
Ming Lin University of Michigan

DOI:

https://doi.org/10.1609/aaai.v32i1.11698

Keywords:

PU Learning, Generalization error, Classification

Abstract

The PU learning problem concerns about learning from positive and unlabeled data. A popular heuristic is to iteratively enlarge training set based on some margin-based criterion. However, little theoretical analysis has been conducted to support the success of these heuristic methods. In this work, we show that not all margin-based heuristic rules are able to improve the learned classifiers iteratively. We find that a so-called large positive margin oracle is necessary to guarantee the success of PU learning. Under this oracle, a provable positive-margin based PU learning algorithm is proposed for linear regression and classification under the truncated Gaussian distributions. The proposed algorithm is able to reduce the recovering error geometrically proportional to the positive margin. Extensive experiments on real-world datasets verify our theory and the state-of-the-art performance of the proposed PU learning algorithm.

Margin Based PU Learning

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information