VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning
DOI:
https://doi.org/10.1609/aaai.v40i3.37152Abstract
Large language model (LLM) agents have emerged as a promising solution for enhancing recommendation systems via user simulation. However, existing studies predominantly resort to prompt-based simulation using frozen LLMs, which frequently results in suboptimal item modeling and user preference learning, thereby ultimately constraining recommendation performance. To address these challenges, we introduce VRAgent-R1, a novel agent-based paradigm that incorporates human-like intelligence in user simulation. Specifically, VRAgent-R1 comprises two distinct agents: the Item Perception (IP) Agent and the User Simulation (US) Agent, designed for interactive user-item modeling. Firstly, the IP Agent emulates human-like progressive thinking based on MLLMs, effectively capturing hidden recommendation semantics in videos. With a more comprehensive multimodal content understanding provided by the IP Agent, the video recommendation system is equipped to provide higher-quality candidate items. Subsequently, the US Agent refines the recommended video sets based on in-depth chain-of-thought (CoT) reasoning and achieves better alignment with real user preferences through reinforcement learning. Experimental results on a large-scale video recommendation benchmark MicroLens-100k have demonstrated the effectiveness of our proposed VRAgent-R1 method, e.g., the IP Agent achieves a 6.0% improvement in NDCG@10, while the US Agent shows approximately 45.0% higher accuracy in user decision simulation compared to state-of-the-art baselines.Published
2026-03-14
How to Cite
Chen, S., Chen, B., Luo, Y., Yu, C., Ouyang, Y., Cheng, L., … Wang, Y. (2026). VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(3), 1739–1747. https://doi.org/10.1609/aaai.v40i3.37152
Issue
Section
AAAI Technical Track on Cognitive Modeling & Cognitive Systems