LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation

Qidong Liu; Xian Wu; Wanyu Wang; Yejing Wang; Yuanshao Zhu; Xiangyu Zhao; Feng Tian; Yefeng Zheng

doi:10.1609/aaai.v39i11.33327

Authors

Qidong Liu School of Auto. Science & Engineering, MOEKLINNS Lab, Xi'an Jiaotong University City University of Hong Kong
Xian Wu Jarvis Research Center, Tencent YouTu Lab
Wanyu Wang City University of Hong Kong
Yejing Wang City University of Hong Kong
Yuanshao Zhu City University of Hong Kong
Xiangyu Zhao City University of Hong Kong
Feng Tian School of Comp. Science & Technology, MOEKLINNS Lab, Xi’an Jiaotong University
Yefeng Zheng Jarvis Research Center, Tencent YouTu Lab Medical Artificial Intelligence Lab, Westlake University

DOI:

https://doi.org/10.1609/aaai.v39i11.33327

Abstract

Sequential Recommender Systems (SRS), which model a user's interaction history to predict the next item of interest, are widely used in various applications. However, existing SRS often struggle with low-popularity items, a challenge known as the long-tail problem. This issue leads to reduced serendipity for users and diminished profits for sellers, ultimately harming the overall system. Large Language Model (LLM) has the ability to capture semantic relationships between items, independent of their popularity, making them a promising solution to this problem. In this paper, we introduce LLMEmb, a novel method leveraging LLM to generate item embeddings that enhance SRS performance. To bridge the gap between general-purpose LLM and the recommendation domain, we propose a Supervised Contrastive Fine-Tuning (SCFT) approach. This approach includes attribute-level data augmentation and a tailored contrastive loss to make LLM more recommendation-friendly. Additionally, we emphasize the importance of integrating collaborative signals into LLM-generated embeddings, for which we propose Recommendation Adaptation Training (RAT). This further refines the embeddings for optimal use in SRS. The LLMEmb-derived embeddings can be seamlessly integrated with any SRS model, underscoring the practical value. Comprehensive experiments conducted on three real-world datasets demonstrate that LLMEmb significantly outperforms existing methods across multiple SRS models.

LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information