Combining Crowd and Expert Labels Using Decision Theoretic Active Learning

Authors

  • An Nguyen University of Texas at Austin
  • Byron Wallace University of Texas at Austin
  • Matthew Lease University of Texas at Austin

DOI:

https://doi.org/10.1609/hcomp.v3i1.13225

Abstract

We consider a finite-pool data categorization scenario which requires exhaustively classifying a given set of examples with a limited budget. We adopt a hybrid human-machine approach which blends automatic machine learning with human labeling across a tiered workforce composed of domain experts and crowd workers. To effectively achieve high-accuracy labels over the instances in the pool at minimal cost, we develop a novel approach based on decision-theoretic active learning. On the important task of biomedical citation screening for systematic reviews, results on real data show that our method achieves consistent improvements over baseline strategies. To foster further research by others, we have made our data available online.

Downloads

Published

2015-09-23

How to Cite

Nguyen, A., Wallace, B., & Lease, M. (2015). Combining Crowd and Expert Labels Using Decision Theoretic Active Learning. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 3(1), 120-129. https://doi.org/10.1609/hcomp.v3i1.13225