LLM4RSR: Large Language Models as Data Correctors for Robust Sequential Recommendation

Authors

  • Yatong Sun School of Computer Science and Engineering, Northeastern University, China School of Computing, Macquarie University, Australia
  • Xiaochun Yang School of Computer Science and Engineering, Northeastern University, China
  • Zhu Sun Singapore University of Technology and Design, Singapore
  • Yan Wang School of Computing, Macquarie University, Australia
  • Bin Wang School of Computer Science and Engineering, Northeastern University, China National Frontiers Science Center for Industrial Intelligence and Systems Optimization, China Key Laboratory of Data Analytics and Optimization for Smart Industry (Northeastern University), Ministry of Education, China
  • Xinghua Qu Bytedance(Seed), Singapore

DOI:

https://doi.org/10.1609/aaai.v39i12.33374

Abstract

Sequential Recommenders (SRs) are trained to predict the next item as the target given its preceding items as the input, assuming every input-target pair is matched and is reliable for training. However, users can be induced by external distractions to click on items inconsistent with their true preferences, resulting in unreliable training instances with mismatched input-target pairs. To resist unreliable data, researchers attempt to develop Robust SRs (RSRs). However, our data analysis unveils that existing RSRs are data-driven. That is, for most instances formed by infrequently co-occurred items, existing RSRs are uncertain about their reliability. To fill this gap, we propose a generic framework -- LLM4RSR (Large Language Models for Robust Sequential Recommendation) to semantically complement data-driven RSRs by correcting uncertain instances into reliable ones based on LLMs' semantic comprehension of items beyond co-occurrence. In this way, RSRs can be re-trained with the corrected data for better accuracy. This is a selective knowledge distillation procedure, where the LLM acts as a teacher guiding student RSRs via uncertain instances. To align LLMs with the data correction task and mitigate inherent hallucinations, we equip the LLM with profile, plan, and memory modules, which are automatically optimized via textual gradient descent, eliminating the need for human effort and expertise. Experiments on four real-world datasets spanning eight backbones verify the generality, effectiveness, and efficiency of LLM4RSR.

Downloads

Published

2025-04-11

How to Cite

Sun, Y., Yang, X., Sun, Z., Wang, Y., Wang, B., & Qu, X. (2025). LLM4RSR: Large Language Models as Data Correctors for Robust Sequential Recommendation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(12), 12604–12612. https://doi.org/10.1609/aaai.v39i12.33374

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II