Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation

Authors

  • Derong Xu University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence City University of Hong Kong
  • Xinhang Li University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence
  • Ziheng Zhang Jarvis Research Center, Tencent YouTu Lab
  • Zhenxi Lin Jarvis Research Center, Tencent YouTu Lab
  • Zhihong Zhu Peking University
  • Zhi Zheng University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence
  • Xian Wu Jarvis Research Center, Tencent YouTu Lab
  • Xiangyu Zhao City University of Hong Kong
  • Tong Xu University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence
  • Enhong Chen University of Science and Technology of China & State Key Laboratory of Cognitive Intelligence

DOI:

https://doi.org/10.1609/aaai.v39i24.34747

Abstract

Large Language Models (LLMs) demonstrate remarkable capabilities, yet struggle with hallucination and outdated knowledge when tasked with complex knowledge reasoning, resulting in factually incorrect outputs. Previous studies have attempted to mitigate it by retrieving factual knowledge from large-scale knowledge graphs (KGs) to assist LLMs in logical reasoning and prediction of answers. However, this kind of approach often introduces noise and irrelevant data, especially in situations with extensive context from multiple knowledge aspects. In this way, LLM attention can be potentially mislead from question and relevant information. In our study, we introduce an Adaptive Multi-Aspect Retrieval-augmented over KGs (Amar) framework. This method retrieves knowledge including entities, relations, and subgraphs, and converts each piece of retrieved text into prompt embeddings. The Amar framework comprises two key sub-components: 1) a self-alignment module that aligns commonalities among entities, relations, and subgraphs to enhance retrieved text, thereby reducing noise interference; 2) a relevance gating module that employs a soft gate to learn the relevance score between question and multi-aspect retrieved data, to determine which information should be used to enhance LLMs' output, or even filtered altogether. Our method has achieved state-of-the-art performance on two common datasets, WebQSP and CWQ, showing a 1.9% improvement in accuracy over its best competitor and a 6.6% improvement in logical form generation over a method that directly uses retrieved text as context prompts. These results demonstrate the effectiveness of Amar in improving the reasoning of LLMs.

Downloads

Published

2025-04-11

How to Cite

Xu, D., Li, X., Zhang, Z., Lin, Z., Zhu, Z., Zheng, Z., … Chen, E. (2025). Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 39(24), 25570–25578. https://doi.org/10.1609/aaai.v39i24.34747

Issue

Section

AAAI Technical Track on Natural Language Processing III