LLM4RSR: Large Language Models as Data Correctors for Robust Sequential Recommendation

Yatong Sun; Xiaochun Yang; Zhu Sun; Yan Wang; Bin Wang; Xinghua Qu

doi:10.1609/aaai.v39i12.33374

Authors

Yatong Sun School of Computer Science and Engineering, Northeastern University, China School of Computing, Macquarie University, Australia
Xiaochun Yang School of Computer Science and Engineering, Northeastern University, China
Zhu Sun Singapore University of Technology and Design, Singapore
Yan Wang School of Computing, Macquarie University, Australia
Bin Wang School of Computer Science and Engineering, Northeastern University, China National Frontiers Science Center for Industrial Intelligence and Systems Optimization, China Key Laboratory of Data Analytics and Optimization for Smart Industry (Northeastern University), Ministry of Education, China
Xinghua Qu Bytedance(Seed), Singapore

DOI:

https://doi.org/10.1609/aaai.v39i12.33374

Abstract

Sequential Recommenders (SRs) are trained to predict the next item as the target given its preceding items as the input, assuming every input-target pair is matched and is reliable for training. However, users can be induced by external distractions to click on items inconsistent with their true preferences, resulting in unreliable training instances with mismatched input-target pairs. To resist unreliable data, researchers attempt to develop Robust SRs (RSRs). However, our data analysis unveils that existing RSRs are data-driven. That is, for most instances formed by infrequently co-occurred items, existing RSRs are uncertain about their reliability. To fill this gap, we propose a generic framework -- LLM4RSR (Large Language Models for Robust Sequential Recommendation) to semantically complement data-driven RSRs by correcting uncertain instances into reliable ones based on LLMs' semantic comprehension of items beyond co-occurrence. In this way, RSRs can be re-trained with the corrected data for better accuracy. This is a selective knowledge distillation procedure, where the LLM acts as a teacher guiding student RSRs via uncertain instances. To align LLMs with the data correction task and mitigate inherent hallucinations, we equip the LLM with profile, plan, and memory modules, which are automatically optimized via textual gradient descent, eliminating the need for human effort and expertise. Experiments on four real-world datasets spanning eight backbones verify the generality, effectiveness, and efficiency of LLM4RSR.

LLM4RSR: Large Language Models as Data Correctors for Robust Sequential Recommendation

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information