G-UBS: Towards Robust Understanding of Implicit Feedback via Group-Aware User Behavior Simulation

Authors

  • Boyu Chen Shenzhen Key Laboratory of Computer Vision and Pattern Recognition, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, University of Chinese Academy of Science, Beijing, China, Platform and Content Group, Tencent, Shenzhen, China
  • Siran Chen Shenzhen Key Laboratory of Computer Vision and Pattern Recognition, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, University of Chinese Academy of Science, Beijing, China, Platform and Content Group, Tencent, Shenzhen, China
  • Zhengrong Yue Shanghai Jiaotong University
  • Kainan Yan Shenzhen Key Laboratory of Computer Vision and Pattern Recognition, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, University of Chinese Academy of Science, Beijing, China,
  • Chenyun Yu Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
  • Beibei Kong Platform and Content Group, Tencent, Shenzhen, China
  • Lei Cheng Platform and Content Group, Tencent, Shenzhen, China Cheng is the last name. Lei is the given name.
  • Chengxiang Zhuo Platform and Content Group, Tencent, Shenzhen, China
  • Zang Li Platform and Content Group, Tencent, Shenzhen, China
  • Yali Wang Shenzhen Key Laboratory of Computer Vision and Pattern Recognition, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, Shanghai Artificial Intelligence Laboratory, Shanghai, China

DOI:

https://doi.org/10.1609/aaai.v40i35.40174

Abstract

User feedback is critical for refining recommendation systems, yet explicit feedback (e.g., likes or dislikes) remains scarce in practice. As a more feasible alternative, inferring user preferences from massive implicit feedback has shown great potential (e.g., a user quickly skipping a recommended video usually indicates disinterest). Unfortunately, implicit feedback is often noisy: a user might skip a video due to accidental clicks or other reasons, rather than disliking it. Such noise can easily misjudge user interests, thereby undermining recommendation performance. To address this issue, we propose a novel Group-aware User Behavior Simulation (G-UBS) paradigm, which leverages contextual guidance from relevant user groups, enabling robust and in-depth interpretation of implicit feedback for individual users. Specifically, G-UBS operates via two key agents. First, the User Group Manager (UGM) effectively clusters users to generate group profiles utilizing a ``summarize-cluster-reflect" workflow based on LLMs. Second, the User Feedback Modeler (UFM) employs an innovative group-aware reinforcement learning approach, where each user is guided by the associated group profiles during the reinforcement learning process, allowing UFM to robustly and deeply examine the reasons behind implicit feedback. To assess our G-UBS paradigm, we have constructed a Video Recommendation benchmark with Implicit Feedback (IF-VR). To the best of our knowledge, this is the first multi-modal benchmark for implicit feedback evaluation in video recommendation, encompassing 15k users, 25k videos, and 933k interaction records with implicit feedback. Extensive experiments on IF-VR demonstrate that G-UBS significantly outperforms mainstream LLMs and MLLMs, with a 4.0% higher proportion of videos achieving a play rate > 30% and 14.9% higher reasoning accuracy on IF-VR.

Downloads

Published

2026-03-14

How to Cite

Chen, B., Chen, S., Yue, Z., Yan, K., Yu, C., Kong, B., Cheng, L., Zhuo, C., Li, Z., & Wang, Y. (2026). G-UBS: Towards Robust Understanding of Implicit Feedback via Group-Aware User Behavior Simulation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(35), 29341-29349. https://doi.org/10.1609/aaai.v40i35.40174

Issue

Section

AAAI Technical Track on Multiagent Systems