Get a Head Start: On-Demand Pedagogical Policy Selection in Intelligent Tutoring

Ge Gao; Xi Yang; Min  Chi

doi:10.1609/aaai.v38i11.29102

Authors

Ge Gao North Carolina State University
Xi Yang IBM Research
Min Chi North Carolina State University

DOI:

https://doi.org/10.1609/aaai.v38i11.29102

Keywords:

ML: Reinforcement Learning, HAI: Human-Computer Interaction, APP: Other Applications, ML: Applications

Abstract

Reinforcement learning (RL) is broadly employed in human-involved systems to enhance human outcomes. Off-policy evaluation (OPE) has been pivotal for RL in those realms since online policy learning and evaluation can be high-stake. Intelligent tutoring has raised tremendous attentions as highly challenging when applying OPE to human-involved systems, due to that students' subgroups can favor different pedagogical policies and the costly procedure that policies have to be induced fully offline and then directly deployed to the upcoming semester. In this work, we formulate on-demand pedagogical policy selection (ODPS) to tackle the challenges for OPE in intelligent tutoring. We propose a pipeline, EduPlanner, as a concrete solution for ODPS. Our pipeline results in an theoretically unbiased estimator, and enables efficient and customized policy selection by identifying subgroups over both historical data and on-arrival initial logs. We evaluate our approach on the Probability ITS that has been used in real classrooms for over eight years. Our study shows significant improvement on learning outcomes of students with EduPlanner, especially for the ones associated with low-performing subgroups.

Get a Head Start: On-Demand Pedagogical Policy Selection in Intelligent Tutoring

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription