K-ON: Stacking Knowledge on the Head Layer of Large Language Model

Lingbing Guo; Yichi Zhang; Zhongpu Bo; Zhuo Chen; Mengshu Sun; Zhiqiang Zhang; Wen Zhang; Huajun Chen

doi:10.1609/aaai.v39i11.33278

Authors

Lingbing Guo College of Computer Science and Technology, Zhejiang University ZJU-Ant Group Joint Lab of Knowledge Graph
Yichi Zhang College of Computer Science and Technology, Zhejiang University ZJU-Ant Group Joint Lab of Knowledge Graph
Zhongpu Bo Ant Group
Zhuo Chen College of Computer Science and Technology, Zhejiang University ZJU-Ant Group Joint Lab of Knowledge Graph
Mengshu Sun Ant Group
Zhiqiang Zhang Ant Group
Wen Zhang School of Software Technology, Zhejiang University ZJU-Ant Group Joint Lab of Knowledge Graph
Huajun Chen College of Computer Science and Technology, Zhejiang University ZJU-Ant Group Joint Lab of Knowledge Graph Zhejiang Key Laboratory of Big Data Intelligent Computing

DOI:

https://doi.org/10.1609/aaai.v39i11.33278

Abstract

Recent advancements in large language models (LLMs) have significantly improved various natural language processing (NLP) tasks. Typically, LLMs are trained to predict the next token, aligning well with many NLP tasks. However, in knowledge graph (KG) scenarios, entities are the fundamental units and identifying an entity requires at least several tokens. This leads to a granularity mismatch between KGs and natural languages. To address this issue, we propose K-ON, which integrates KG knowledge into the LLM by employing multiple head layers for next k-step prediction. K-ON can not only generate entity-level results in one step, but also enables contrastive loss against entities, which is the most powerful tool in KG representation learning. Experimental results show that K-ON outperforms state-of-the-art methods that incorporate text and even the other modalities.

K-ON: Stacking Knowledge on the Head Layer of Large Language Model

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information