A Convex Formulation for Learning from Crowds

Authors

  • Hiroshi Kajino The University of Tokyo
  • Yuta Tsuboi IBM Research - Tokyo
  • Hisashi Kashima The University of Tokyo

DOI:

https://doi.org/10.1609/aaai.v26i1.8105

Keywords:

Crowdsourcing, Convex Formulation, Machine Learning

Abstract

Recently crowdsourcing services are often used to collect a large amount of labeled data for machine learning, since they provide us an easy way to get labels at very low cost and in a short period. The use of crowdsourcing has introduced a new challenge in machine learning, that is, coping with the variable quality of crowd-generated data. Although there have been many recent attempts to address the quality problem of multiple workers, only a few of the existing methods consider the problem of learning classifiers directly from such noisy data. All these methods modeled the true labels as latent variables, which resulted in non-convex optimization problems. In this paper, we propose a convex optimization formulation for learning from crowds without estimating the true labels by introducing personal models of the individual crowd workers. We also devise an efficient iterative method for solving the convex optimization problems by exploiting conditional independence structures in multiple classifiers. We evaluate the proposed method against three competing methods on synthetic data sets and a real crowdsourced data set and demonstrate that the proposed method outperforms the other three methods.

Downloads

Published

2021-09-20

How to Cite

Kajino, H., Tsuboi, Y., & Kashima, H. (2021). A Convex Formulation for Learning from Crowds. Proceedings of the AAAI Conference on Artificial Intelligence, 26(1), 73-79. https://doi.org/10.1609/aaai.v26i1.8105