When Top-ranked Recommendations Fail: Modeling Multi-Granular Negative Feedback for Explainable and Robust Video Recommendation

Authors

  • Siran Chen University of Chinese Academy of Sciences Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
  • Boyu Chen University of Chinese Academy of Sciences Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
  • Chenyun Yu Shenzhen Campus of Sun Yat-sen University
  • Yi Ouyang Platform and Content Group, Tencent
  • Lei Cheng Platform and Content Group, Tencent
  • Chengxiang Zhuo Platform and Content Group, Tencent
  • Zang Li Platform and Content Group, Tencent
  • Yali Wang Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences Shanghai Artificial Intelligence Laboratory

DOI:

https://doi.org/10.1609/aaai.v40i24.39114

Abstract

Existing video recommendation systems, relying mainly on ID-based embedding mapping and collaborative filtering, often fail to capture in-depth video content semantics. Moreover, most struggle to address biased user behaviors (e.g., accidental clicks, fast skips), leading to inaccurate interest modeling and frequent negative feedback in top recommendations with unclear causes. To tackle this issue, we collect real-world user video-watching sequences, annotate the reasons for users' dislikes, and construct a benchmark dataset for personalized explanations. We then introduce the Agentic Explainable Negative Feedback (ENF) framework, which integrates three core components: (1) the Profile Agent, extracting behavioral cues from users' historical data to derive psychological and personality profiles; (2) the Video Agent, performing comprehensive multimodal video analysis; and (3) the Reason Agent, synthesizing information from the other two agents to predict user engagement and generate explanations. Additionally, we propose the S-GRPO algorithm, enabling the model to progressively address complex tasks during reinforcement fine-tuning. Experimental results on the collected dataset show that our method significantly outperforms state-of-the-art baselines in negative feedback prediction and reason explanation. Notably, it achieves an 8.6% improvement over GPT-4o in reason classification. Deployment on the business platform further validates its benefits: increasing average user watch time by 6.2%, reducing the fast-skip rate by 9.4% , and significantly enhancing user satisfaction.

Downloads

Published

2026-03-14

How to Cite

Chen, S., Chen, B., Yu, C., Ouyang, Y., Cheng, L., Zhuo, C., … Wang, Y. (2026). When Top-ranked Recommendations Fail: Modeling Multi-Granular Negative Feedback for Explainable and Robust Video Recommendation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(24), 20262–20270. https://doi.org/10.1609/aaai.v40i24.39114

Issue

Section

AAAI Technical Track on Machine Learning I