UNO! UNified Offline Training Paradigm for Learning Path Recommendation

Authors

  • Linzhi Peng Beihang University
  • Wentao Zhu Beihang University
  • Ke Cheng Beihang University
  • Heng Chang Tsinghua University
  • Junchen Ye Beihang University
  • Bowen Du Beihang University Zhongguancun Laboratory
  • Weifeng Lv Beihang University

DOI:

https://doi.org/10.1609/aaai.v40i18.38591

Abstract

With the wide adoption of online education platforms, adaptive learning systems have become increasingly important. Learning Path Recommendation (LPR) aims to dynamically adjust learning content to optimize learning efficiency based on individual student needs. However, current LPR methods suffer from sparse reward for precise assessment and only focus on anonymous sessions that overlook more personalized and effective paths. To address these challenges, we propose UNO, UNified Offline Training Paradigm for Learning Path Recommendation. This approach introduces an offline training paradigm in RL-based LPR to provide dense process rewards by a personalized advantage based on a reward model, which can estimate the students' internal knowledge levels on the learning targets. Additionally, we propose UniLPR model, a personalized recommendation system that unifies modeling the implicit relationships between students' long-term accumulation and evolving requirements for questions, and refines through Group Relative Policy Optimization(GRPO). Finally, we design learning tasks that encompass historical reviewing, recent learning, and long-term exploratory learning to simulate the comprehensive and diverse learning needs of students. Our UNO achieves state-of-the-art performance across all tasks, demonstrating its effectiveness.

Published

2026-03-14

How to Cite

Peng, L., Zhu, W., Cheng, K., Chang, H., Ye, J., Du, B., & Lv, W. (2026). UNO! UNified Offline Training Paradigm for Learning Path Recommendation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(18), 15617–15625. https://doi.org/10.1609/aaai.v40i18.38591

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II