ProgRAG: Hallucination-Resistant Progressive Retrieval and Reasoning over Knowledge Graphs

Authors

  • Minbae Park Hanyang University
  • Hyemin Yang Hanyang University
  • Jeonghyun Kim Hanyang University
  • Kunsoo Park Seoul National University
  • Hyunjoon Kim Hanyang University

DOI:

https://doi.org/10.1609/aaai.v40i39.40545

Abstract

Large Language Models (LLMs) demonstrate strong reasoning capabilities but still struggle with hallucinations and limited transparency. Recently, KG-enhanced LLMs that integrate knowledge graphs (KGs) have been shown to improve reasoning performance, particularly for complex, knowledge-intensive tasks. However, these methods still face significant challenges, including inaccurate retrieval and reasoning failures, often exacerbated by long input contexts that obscure relevant information. Furthermore, many of these approaches rely on LLMs to directly retrieve evidence from KGs, and to self-assess the sufficiency of this evidence, which often results in premature or incorrect reasoning. To address the retrieval and reasoning failures, we propose ProgRAG, a multi-hop knowledge graph question answering (KGQA) framework that decomposes complex questions into sub-questions, and progressively extends partial reasoning paths by answering each sub-question. At each step, external retrievers gather candidate evidence, which is then refined through uncertainty-aware pruning by the LLM. Finally, the context for LLM reasoning is optimized by organizing and rearranging the partial reasoning paths obtained from the sub-question answers. Experiments on two well-known datasets, WebQSP and CWQ, demonstrate that ProgRAG outperforms existing baselines in multi-hop KGQA, offering improved reliability and reasoning quality.

Published

2026-03-14

How to Cite

Park, M., Yang, H., Kim, J., Park, K., & Kim, H. (2026). ProgRAG: Hallucination-Resistant Progressive Retrieval and Reasoning over Knowledge Graphs. Proceedings of the AAAI Conference on Artificial Intelligence, 40(39), 32674–32682. https://doi.org/10.1609/aaai.v40i39.40545

Issue

Section

AAAI Technical Track on Natural Language Processing IV