Beyond ReAct: A Planner-Centric Framework for Complex Tool-Augmented LLM Reasoning

Authors

  • Xiaolong Wei Beihang University
  • Yuehu Dong Baidu Inc.
  • Xingliang Wang Beijing University of Posts and Telecommunications
  • Xingyu Zhang Beijing Jiaotong University
  • Zhejun Zhao Baidu Inc.
  • Dongdong Shen Baidu Inc.
  • Long Xia Baidu Inc.
  • Dawei Yin Baidu Inc.

DOI:

https://doi.org/10.1609/aaai.v40i40.40676

Abstract

Existing tool-augmented large language models (LLMs) encounter significant challenges when processing complex queries. Current frameworks such as ReAct are prone to local optimization traps due to their reliance on incremental decision-making processes. To address these limitations, we propose a novel Planner-centric Plan-Execute paradigm that fundamentally resolves local optimization bottlenecks through architectural innovation. Central to our approach is a novel Planner model that performs global Directed Acyclic Graph (DAG) planning for complex queries, enabling optimized execution beyond conventional tool coordination. We also introduce ComplexTool-Plan, a large-scale benchmark dataset featuring complex queries that demand sophisticated multi-tool composition and coordination capabilities. Additionally, we develop a two-stage training methodology that integrates Supervised Fine-Tuning (SFT) with Group Relative Policy Optimization (GRPO), systematically enhancing the Planner's tool selection accuracy and global planning awareness through structured DAG-based planning. When integrated with a capable executor, our framework achieves state-of-the-art performance on the StableToolBench benchmark for complex user queries, demonstrating superior end-to-end execution capabilities and robust handling of intricate multi-tool workflows.

Downloads

Published

2026-03-14

How to Cite

Wei, X., Dong, Y., Wang, X., Zhang, X., Zhao, Z., Shen, D., … Yin, D. (2026). Beyond ReAct: A Planner-Centric Framework for Complex Tool-Augmented LLM Reasoning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(40), 33845–33853. https://doi.org/10.1609/aaai.v40i40.40676

Issue

Section

AAAI Technical Track on Natural Language Processing V