Imbalanced Multiple Noisy Labeling for Supervised Learning

Authors

  • Jing Zhang Hefei University of Technology
  • Xindong Wu University of Vermont
  • Victor Sheng University of Central Arkansas

DOI:

https://doi.org/10.1609/aaai.v27i1.8530

Keywords:

Crowdsourcing, Multiple Noisy Labeling, Supervised Leaning

Abstract

When labeling objects via Internet-based outsourcing systems, the labelers may have bias, because they lack expertise, dedication and personal preference. These reasons cause Imbalanced Multiple Noisy Labeling. To deal with the imbalance labeling issue, we propose an agnostic algorithm PLAT (Positive LAbel frequency Threshold) which does not need any information about quality of labelers and underlying class distribution. Simulations on eight real-world datasets with different underlying class distributions demonstrate that PLAT not only effectively deals with the imbalanced multiple noisy labeling problem that off-the-shelf agnostic methods cannot cope with, but also performs nearly the same as majority voting under the circumstances that labelers have no bias.

Downloads

Published

2013-06-29

How to Cite

Zhang, J., Wu, X., & Sheng, V. (2013). Imbalanced Multiple Noisy Labeling for Supervised Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 27(1), 1651-1652. https://doi.org/10.1609/aaai.v27i1.8530