Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward

Authors

  • Mengyuan Yang Zhejiang University, China
  • Mengying Zhu Zhejiang University, China
  • Yan Wang School of Computing, Macqaurie University, Australia
  • Linxun Chen MYbank, Ant Group, China
  • Yilei Zhao Zhejiang University, China
  • Xiuyuan Wang Zhejiang University, China
  • Bing Han MYbank, Ant Group, China
  • Xiaolin Zheng Zhejiang University, China
  • Jianwei Yin Zhejiang University, China

DOI:

https://doi.org/10.1609/aaai.v38i8.28777

Keywords:

DMKM: Recommender Systems, NLP: (Large) Language Models

Abstract

Large language model-based explainable recommendation (LLM-based ER) systems can provide remarkable human-like explanations and have widely received attention from researchers. However, the original LLM-based ER systems face three low-quality problems in their generated explanations, i.e., lack of personalization, inconsistency, and questionable explanation data. To address these problems, we propose a novel LLM-based ER model denoted as LLM2ER to serve as a backbone and devise two innovative explainable quality reward models for fine-tuning such a backbone in a reinforcement learning paradigm, ultimately yielding a fine-tuned model denoted as LLM2ER-EQR, which can provide high-quality explanations. LLM2ER-EQR can generate personalized, informative, and consistent high-quality explanations learned from questionable-quality explanation datasets. Extensive experiments conducted on three real-world datasets demonstrate that our model can generate fluent, diverse, informative, and highly personalized explanations.

Published

2024-03-24

How to Cite

Yang, M., Zhu, M., Wang, Y., Chen, L., Zhao, Y., Wang, X., Han, B., Zheng, X., & Yin, J. (2024). Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward. Proceedings of the AAAI Conference on Artificial Intelligence, 38(8), 9250-9259. https://doi.org/10.1609/aaai.v38i8.28777

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management