Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration

Authors

  • Zijian Wang College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China College of Science, China University of Petroleum(East China), China
  • Bin Wang College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China
  • Haifeng Jing College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China School of Software & Microelectronics, Peking University, China
  • Huayu Li College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China
  • Hongbo Dou College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China

DOI:

https://doi.org/10.1609/aaai.v39i12.33398

Abstract

Recent years, multi-hop reasoning has been widely studied for knowledge graph (KG) reasoning due to its efficacy and interpretability. However, previous multi-hop reasoning approaches are subject to two primary shortcomings. First, agents struggle to learn effective and robust policies at the early phase due to sparse rewards. Second, these approaches often falter on specific datasets like sparse knowledge graphs, where agents are required to traverse lengthy reasoning paths. To address these problems, we propose a multi-hop reasoning model with dual agents based on hierarchical reinforcement learning (HRL), which is named FULORA. FULORA tackles the above reasoning challenges by eFficient GUidance-ExpLORAtion between dual agents. The high-level agent walks on the simplified knowledge graph to provide stage-wise hints for the low-level agent walking on the original knowledge graph. In this framework, the low-level agent optimizes a value function that balances two objectives: (1) maximizing return, and (2) integrating efficient guidance from the high-level agent. Experiments conducted on three real-word knowledge graph datasets demonstrate that FULORA outperforms RL-based baselines, especially in the case of long-distance reasoning.

Published

2025-04-11

How to Cite

Wang, Z., Wang, B., Jing, H., Li, H., & Dou, H. (2025). Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration. Proceedings of the AAAI Conference on Artificial Intelligence, 39(12), 12818–12826. https://doi.org/10.1609/aaai.v39i12.33398

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II