Get a Head Start: On-Demand Pedagogical Policy Selection in Intelligent Tutoring

Authors

  • Ge Gao North Carolina State University
  • Xi Yang IBM Research
  • Min Chi North Carolina State University

DOI:

https://doi.org/10.1609/aaai.v38i11.29102

Keywords:

ML: Reinforcement Learning, HAI: Human-Computer Interaction, APP: Other Applications, ML: Applications

Abstract

Reinforcement learning (RL) is broadly employed in human-involved systems to enhance human outcomes. Off-policy evaluation (OPE) has been pivotal for RL in those realms since online policy learning and evaluation can be high-stake. Intelligent tutoring has raised tremendous attentions as highly challenging when applying OPE to human-involved systems, due to that students' subgroups can favor different pedagogical policies and the costly procedure that policies have to be induced fully offline and then directly deployed to the upcoming semester. In this work, we formulate on-demand pedagogical policy selection (ODPS) to tackle the challenges for OPE in intelligent tutoring. We propose a pipeline, EduPlanner, as a concrete solution for ODPS. Our pipeline results in an theoretically unbiased estimator, and enables efficient and customized policy selection by identifying subgroups over both historical data and on-arrival initial logs. We evaluate our approach on the Probability ITS that has been used in real classrooms for over eight years. Our study shows significant improvement on learning outcomes of students with EduPlanner, especially for the ones associated with low-performing subgroups.

Published

2024-03-24

How to Cite

Gao, G., Yang, X., & Chi, M. . (2024). Get a Head Start: On-Demand Pedagogical Policy Selection in Intelligent Tutoring. Proceedings of the AAAI Conference on Artificial Intelligence, 38(11), 12136-12144. https://doi.org/10.1609/aaai.v38i11.29102

Issue

Section

AAAI Technical Track on Machine Learning II