Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation

Derong Xu; Xinhang Li; Ziheng Zhang; Zhenxi Lin; Zhihong Zhu; Zhi Zheng; Xian Wu; Xiangyu Zhao; Tong Xu; Enhong Chen

doi:10.1609/aaai.v39i24.34747

Authors

Derong Xu University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence City University of Hong Kong
Xinhang Li University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence
Ziheng Zhang Jarvis Research Center, Tencent YouTu Lab
Zhenxi Lin Jarvis Research Center, Tencent YouTu Lab
Zhihong Zhu Peking University
Zhi Zheng University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence
Xian Wu Jarvis Research Center, Tencent YouTu Lab
Xiangyu Zhao City University of Hong Kong
Tong Xu University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence
Enhong Chen University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence

DOI:

https://doi.org/10.1609/aaai.v39i24.34747

Abstract

Large Language Models (LLMs) demonstrate remarkable capabilities, yet struggle with hallucination and outdated knowledge when tasked with complex knowledge reasoning, resulting in factually incorrect outputs. Previous studies have attempted to mitigate it by retrieving factual knowledge from large-scale knowledge graphs (KGs) to assist LLMs in logical reasoning and prediction of answers. However, this kind of approach often introduces noise and irrelevant data, especially in situations with extensive context from multiple knowledge aspects. In this way, LLM attention can be potentially mislead from question and relevant information. In our study, we introduce an Adaptive Multi-Aspect Retrieval-augmented over KGs (Amar) framework. This method retrieves knowledge including entities, relations, and subgraphs, and converts each piece of retrieved text into prompt embeddings. The Amar framework comprises two key sub-components: 1) a self-alignment module that aligns commonalities among entities, relations, and subgraphs to enhance retrieved text, thereby reducing noise interference; 2) a relevance gating module that employs a soft gate to learn the relevance score between question and multi-aspect retrieved data, to determine which information should be used to enhance LLMs' output, or even filtered altogether. Our method has achieved state-of-the-art performance on two common datasets, WebQSP and CWQ, showing a 1.9% improvement in accuracy over its best competitor and a 6.6% improvement in logical form generation over a method that directly uses retrieved text as context prompts. These results demonstrate the effectiveness of Amar in improving the reasoning of LLMs.

Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information