Who Said What: Modeling Individual Labelers Improves Classification

Melody Guan; Varun Gulshan; Andrew Dai; Geoffrey Hinton

doi:10.1609/aaai.v32i1.11756

Authors

Melody Guan Stanford University
Varun Gulshan Google Brain
Andrew Dai Google Brain
Geoffrey Hinton Google Brain

DOI:

https://doi.org/10.1609/aaai.v32i1.11756

Keywords:

Deep Learning/Neural Networks, Classification

Abstract

Data are often labeled by many different experts with each expert only labeling a small fraction of the data and each data point being labeled by several experts. This reduces the workload on individual experts and also gives a better estimate of the unobserved ground truth. When experts disagree, the standard approaches are to treat the majority opinion as the correct label or to model the correct label as a distribution. These approaches, however, do not make any use of potentially valuable information about which expert produced which label. To make use of this extra information, we propose modeling the experts individually and then learning averaging weights for combining them, possibly in sample-specific ways. This allows us to give more weight to more reliable experts and take advantage of the unique strengths of individual experts at classifying certain types of data. Here we show that our approach leads to improvements in computer-aided diagnosis of diabetic retinopathy. We also show that our method performs better than competing algorithms by Welinder and Perona (2010); Mnih and Hinton (2012). Our work offers an innovative approach for dealing with the myriad real-world settings that use expert opinions to define labels for training.

Who Said What: Modeling Individual Labelers Improves Classification

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information