Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration

Zijian Wang; Bin Wang; Haifeng Jing; Huayu Li; Hongbo Dou

doi:10.1609/aaai.v39i12.33398

Authors

Zijian Wang College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China College of Science, China University of Petroleum(East China), China
Bin Wang College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China
Haifeng Jing College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China School of Software & Microelectronics, Peking University, China
Huayu Li College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China
Hongbo Dou College of Computer Science and Technology & Qingdao Institute of Software, China University of Petroleum(East China), China

DOI:

https://doi.org/10.1609/aaai.v39i12.33398

Abstract

Recent years, multi-hop reasoning has been widely studied for knowledge graph (KG) reasoning due to its efficacy and interpretability. However, previous multi-hop reasoning approaches are subject to two primary shortcomings. First, agents struggle to learn effective and robust policies at the early phase due to sparse rewards. Second, these approaches often falter on specific datasets like sparse knowledge graphs, where agents are required to traverse lengthy reasoning paths. To address these problems, we propose a multi-hop reasoning model with dual agents based on hierarchical reinforcement learning (HRL), which is named FULORA. FULORA tackles the above reasoning challenges by eFficient GUidance-ExpLORAtion between dual agents. The high-level agent walks on the simplified knowledge graph to provide stage-wise hints for the low-level agent walking on the original knowledge graph. In this framework, the low-level agent optimizes a value function that balances two objectives: (1) maximizing return, and (2) integrating efficient guidance from the high-level agent. Experiments conducted on three real-word knowledge graph datasets demonstrate that FULORA outperforms RL-based baselines, especially in the case of long-distance reasoning.

Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information