Cross-Scale Collaboration between LLMs and Lightweight Sequential Recommenders with Domain-Specific Latent Reasoning

Authors

  • Yipeng Zhang Department of Computer Science and Technology, Tsinghua University
  • Xin Wang Department of Computer Science and Technology, Tsinghua University Beijing National Research Center for Information Science and Technology, Tsinghua University
  • Hong Chen Department of Computer Science and Technology, Tsinghua University
  • Junwei Pan Tencent Inc.
  • Qian Li Tencent Inc.
  • Jun Zhang Tencent Inc.
  • Jie Jiang Tencent Inc.
  • Hong Mei MoE Key Lab of High Confidence Software Technologies, Peking University
  • Wenwu Zhu Department of Computer Science and Technology, Tsinghua University Beijing National Research Center for Information Science and Technology, Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v40i19.38680

Abstract

Sequential recommendation aims to predict the next item based on historical interactions. To further enhance the reasoning capability in sequential recommendation, LLMs are employed to predict the next item or generate semantic IDs for item representation, given LLMs' extensive domain knowledge and reasoning ability. However, existing LLM-based methods suffer from two limitations. (i) The scarcity of recommendation data with reasoning paths makes it challenging to design suitable chain-of-thought prompting templates, and the full potential of LLMs' reasoning abilities remains underutilized. (ii) Upon obtaining semantic IDs, the LLMs and their representations are excluded from the subsequent recommendation model training, preventing downstream models from fully utilizing the rich semantic information encoded within these IDs. To address these issues, we propose a novel CoderRec framework, which is capable of fully exploiting the information encoded in semantic IDs to guide the recommendation process. Specifically, to address the problem of scarcity in reasoning path-augmented data, we introduce latent reasoning into sequential recommendation and treat the representation captured by the downstream model as domain-specific latent thought, enabling implicit logical inference without requiring explicit CoT annotations. To ensure that the downstream recommendation models are able to deeply leverage the semantic information within IDs, we propose a novel cross-scale model collaboration strategy, which employs cross-scale IDs and a two-phase approach to align LLM-derived semantics with recommendation objectives. Extensive experiments have shown the effectiveness of our proposed CoderRec framework.

Downloads

Published

2026-03-14

How to Cite

Zhang, Y., Wang, X., Chen, H., Pan, J., Li, Q., Zhang, J., … Zhu, W. (2026). Cross-Scale Collaboration between LLMs and Lightweight Sequential Recommenders with Domain-Specific Latent Reasoning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(19), 16415–16423. https://doi.org/10.1609/aaai.v40i19.38680

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management III