Efficient Online Crowdsourcing with Complex Annotations

Authors

  • Reshef Meir Technion--Israel Institute of Technology
  • Viet-An Nguyen Central Applied Science, Meta
  • Xu Chen Central Applied Science, Meta
  • Jagdish Ramakrishnan Central Applied Science, Meta
  • Udi Weinsberg Central Applied Science, Meta

DOI:

https://doi.org/10.1609/aaai.v38i9.28876

Keywords:

HAI: Crowd Sourcing and Human Computation

Abstract

Crowdsourcing platforms use various truth discovery algorithms to aggregate annotations from multiple labelers. In an online setting, however, the main challenge is to decide whether to ask for more annotations for each item to efficiently trade off cost (i.e., the number of annotations) for quality of the aggregated annotations. In this paper, we propose a novel approach for general complex annotation (such as bounding boxes and taxonomy paths), that works in an online crowdsourcing setting. We prove that the expected average similarity of a labeler is linear in their accuracy conditional on the reported label. This enables us to infer reported label accuracy in a broad range of scenarios. We conduct extensive evaluations on real-world crowdsourcing data from Meta and show the effectiveness of our proposed online algorithms in improving the cost-quality trade-off.

Published

2024-03-24

How to Cite

Meir, R., Nguyen, V.-A., Chen, X., Ramakrishnan, J., & Weinsberg, U. (2024). Efficient Online Crowdsourcing with Complex Annotations. Proceedings of the AAAI Conference on Artificial Intelligence, 38(9), 10119-10127. https://doi.org/10.1609/aaai.v38i9.28876

Issue

Section

AAAI Technical Track on Humans and AI