Crowdsourcing Feature Discovery via Adaptively Chosen Comparisons

Authors

  • James Zou Microsoft Research
  • Kamalika Chaudhuri University of California, San Diego
  • Adam Kalai Microsoft Research

DOI:

https://doi.org/10.1609/hcomp.v3i1.13231

Keywords:

feature learning, crowd-sourcing

Abstract

We introduce an unsupervised approach to efficiently discover the underlying features in a data set via crowdsourcing. Our queries ask crowd members to articulate a feature common to two out of three displayed examples. In addition, we ask the crowd to provide binary labels for these discovered features on the remaining examples. The triples are chosen adaptively based on the labels of the previously discovered features on the data set. This approach is motivated by a formal framework of feature elicitation that we introduce and analyze in this paper. In two natural models of features, hierarchical and independent, we show that a simple adaptive algorithm recovers all features with less labor than any nonadaptive algorithm. The savings are as a result of automatically avoiding the elicitation of redundant features or synonyms. Experimental results validate the theoretical findings and the usefulness of this approach.

Downloads

Published

2015-09-23

How to Cite

Zou, J., Chaudhuri, K., & Kalai, A. (2015). Crowdsourcing Feature Discovery via Adaptively Chosen Comparisons. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 3(1), 198-205. https://doi.org/10.1609/hcomp.v3i1.13231