Better Peer Grading through Bayesian Inference

Hedayat Zarkoob; Greg d'Eon; Lena Podina; Kevin Leyton-Brown

doi:10.1609/aaai.v37i5.25757

Authors

Hedayat Zarkoob University of British Columbia
Greg d'Eon University of British Columbia
Lena Podina University of Waterloo University of British Columbia
Kevin Leyton-Brown University of British Columbia

DOI:

https://doi.org/10.1609/aaai.v37i5.25757

Keywords:

HAI: Crowdsourcing, APP: Education, GTEP: Applications, GTEP: Mechanism Design

Abstract

Peer grading systems aggregate noisy reports from multiple students to approximate a "true" grade as closely as possible. Most current systems either take the mean or median of reported grades; others aim to estimate students’ grading accuracy under a probabilistic model. This paper extends the state of the art in the latter approach in three key ways: (1) recognizing that students can behave strategically (e.g., reporting grades close to the class average without doing the work); (2) appropriately handling censored data that arises from discrete-valued grading rubrics; and (3) using mixed integer programming to improve the interpretability of the grades assigned to students. We demonstrate how to make Bayesian inference practical in this model and evaluate our approach on both synthetic and real-world data obtained by using our implemented system in four large classes. These extensive experiments show that grade aggregation using our model accurately estimates true grades, students' likelihood of submitting uninformative grades, and the variation in their inherent grading error; we also characterize our models' robustness.

Better Peer Grading through Bayesian Inference

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription