Zero-shot Recommendation: Towards Class Semantic Relation Learning for Inferring Labels of Unseen Micro-videos

Junyang Chen; Huan Wang; Yirui Wu; Qiuzhen Lin; Yunfeng Diao; Junkai Ji

doi:10.1609/aaai.v40i24.39103

Authors

Junyang Chen Shenzhen University
Huan Wang Huazhong Agricultural University
Yirui Wu Hohai University
Qiuzhen Lin Shenzhen University
Yunfeng Diao Hefei University of Technology
Junkai Ji Shenzhen University

DOI:

https://doi.org/10.1609/aaai.v40i24.39103

Abstract

Micro-video label prediction plays a pivotal role on contemporary video-sharing platforms, such as Kwai and Tiktok. The emergence of video content lacking labels presents a formidable challenge for conventional user interest prediction methods. This paper addresses the challenge of micro-video label prediction, particularly for unseen videos, by proposing a zero-shot method called Class Semantic Relation Learning (CSRL). Unlike traditional user interest prediction models, CSRL leverages the pre-trained Large Language Model (LLM) to enhance prediction accuracy for unlabeled videos. The novelty of CSRL lies in its integration of three key components: a raw feature autoencoder, LLM-enhanced features, and a decomposed graph network. The decomposed graph network is specifically designed to disentangle the relationships between labeled and unlabeled videos, offering a significant improvement over previous methods. By fusing hidden topics with LLM-enhanced text, CSRL effectively handles sparse video features. Experiments on large-scale datasets from the Kwai platform show that CSRL achieves state-of-the-art results, with up to 44.64% improvement in Hit Ratio (HR), highlighting its superiority over existing zero-shot recommendation models in predicting user interests within the user-video network.

Zero-shot Recommendation: Towards Class Semantic Relation Learning for Inferring Labels of Unseen Micro-videos

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information