Crowdsourcing Multi-Label Classification for Taxonomy Creation

Authors

  • Jonathan Bragg University of Washington
  • - Mausam University of Washington
  • Daniel Weld University of Washington

DOI:

https://doi.org/10.1609/hcomp.v1i1.13091

Abstract

Recent work has introduced CASCADE, an algorithm for creating a globally-consistent taxonomy by crowdsourcing microwork from many individuals, each of whom may see only a tiny fraction of the data (Chilton et al. 2013). While CASCADE needs only unskilled labor and produces taxonomies whose quality approaches that of human experts, it uses significantly more labor than experts. This paper presents DELUGE, an improved workflow that produces taxonomies with comparable quality using significantly less crowd labor. Specifically, our method for crowdsourcing multi-label classification optimizes CASCADE’s most costly step (categorization) using less than 10% of the labor required by the original approach. DELUGE’s savings come from the use of decision theory and machine learning, which allow it to pose microtasks that aim to maximize information gain.

Downloads

Published

2013-11-03

How to Cite

Bragg, J., Mausam, .-., & Weld, D. (2013). Crowdsourcing Multi-Label Classification for Taxonomy Creation. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 1(1), 25-33. https://doi.org/10.1609/hcomp.v1i1.13091