LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation

Authors

  • Qidong Liu School of Auto. Science & Engineering, MOEKLINNS Lab, Xi'an Jiaotong University City University of Hong Kong
  • Xian Wu Jarvis Research Center, Tencent YouTu Lab
  • Wanyu Wang City University of Hong Kong
  • Yejing Wang City University of Hong Kong
  • Yuanshao Zhu City University of Hong Kong
  • Xiangyu Zhao City University of Hong Kong
  • Feng Tian School of Comp. Science & Technology, MOEKLINNS Lab, Xi’an Jiaotong University
  • Yefeng Zheng Jarvis Research Center, Tencent YouTu Lab Medical Artificial Intelligence Lab, Westlake University

DOI:

https://doi.org/10.1609/aaai.v39i11.33327

Abstract

Sequential Recommender Systems (SRS), which model a user's interaction history to predict the next item of interest, are widely used in various applications. However, existing SRS often struggle with low-popularity items, a challenge known as the long-tail problem. This issue leads to reduced serendipity for users and diminished profits for sellers, ultimately harming the overall system. Large Language Model (LLM) has the ability to capture semantic relationships between items, independent of their popularity, making them a promising solution to this problem. In this paper, we introduce LLMEmb, a novel method leveraging LLM to generate item embeddings that enhance SRS performance. To bridge the gap between general-purpose LLM and the recommendation domain, we propose a Supervised Contrastive Fine-Tuning (SCFT) approach. This approach includes attribute-level data augmentation and a tailored contrastive loss to make LLM more recommendation-friendly. Additionally, we emphasize the importance of integrating collaborative signals into LLM-generated embeddings, for which we propose Recommendation Adaptation Training (RAT). This further refines the embeddings for optimal use in SRS. The LLMEmb-derived embeddings can be seamlessly integrated with any SRS model, underscoring the practical value. Comprehensive experiments conducted on three real-world datasets demonstrate that LLMEmb significantly outperforms existing methods across multiple SRS models.

Published

2025-04-11

How to Cite

Liu, Q., Wu, X., Wang, W., Wang, Y., Zhu, Y., Zhao, X., … Zheng, Y. (2025). LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(11), 12183–12191. https://doi.org/10.1609/aaai.v39i11.33327

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management I