Context Selection and Rewriting for Video-based Educational Question Generation
DOI:
https://doi.org/10.1609/aaai.v40i48.42126Abstract
Educational question generation (EQG) is a crucial component of intelligent educational systems, significantly aiding self-assessment, active learning, and personalized education. While EQG systems have emerged, existing datasets typically rely on predefined, carefully edited texts, failing to represent real-world classroom content, including lecture speech with a set of complementary slides. To bridge this gap, we collect a dataset of educational questions based on videos from real-world lectures. On this realistic dataset, we find that current methods for EQG struggle to accurately generate questions from educational videos, particularly in aligning with specific timestamps and target answers. Common challenges include selecting informative contexts from extensive transcripts and ensuring generated questions meaningfully incorporate the target answer. To address the challenges, we introduce a novel framework utilizing large language models (LLMs) for dynamically selecting and rewriting contexts based on target timestamps and answers in lecture videos. First, our framework selects contexts from both lecture transcripts and video keyframes based on answer relevance and temporal proximity. Then, we integrate the contexts selected from both modalities and rewrite them into answer-containing knowledge statements, to enhance the logical connection between the contexts and the desired answer. Quantitative evaluation and human evaluation show that our approach improves the quality and relevance of the generated questions.Downloads
Published
2026-03-14
How to Cite
Yu, M., Nguyen, B., Zino, O., & Jiang, M. (2026). Context Selection and Rewriting for Video-based Educational Question Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 40925–40933. https://doi.org/10.1609/aaai.v40i48.42126
Issue
Section
EAAI Symposium: AI for Education