RefRea: Reference-Guided Reasoning with Meta-Cognition for Accurate Language Model Agents
DOI:
https://doi.org/10.1609/aaai.v40i38.40522Abstract
In recent years, with the rapid development of large language models (LLMs), LLM-based agents have achieved remarkable progress across a wide range of tasks. However, reasoning inconsistencies in LLMs still significantly limit the performance of agents in complex decision-making scenarios. Cognitive science research suggests that individuals can benefit from observing others' explicit thinking processes to improve their strategy-making. Inspired by this mechanism, we propose Reference-guided Reasoning with meta-cognition (RefRea), a novel approach that enhances decision-making by introducing a reference language model to guide and calibrate the reasoning model's actions. RefRea enhances reasoning accuracy and stability by integrating a reference model and a meta-cognition module. The reference model relies solely on validated meta-cognition for consistent guidance, while the reasoning model interacts with the environment using both validated and exploratory meta-cognition. Guidance is provided by comparing the action similarity between the reference and reasoning models. This process is supported by the meta-cognition module, which generates summary knowledge by reflecting on action history and environmental feedback, leading to more adaptive and reliable behavior. We evaluate our algorithm in the text-based reasoning environment ScienceWorld. Experimental results demonstrate that RefRea outperforms state-of-the-art methods. Comprehensive ablation studies further highlight the effectiveness of both the reference model and the meta-cognition module.Downloads
Published
2026-03-14
How to Cite
Mai, Y., Yin, Q., Ni, W., Guo, J., Ouyang, X., Xu, P., & Huang, K. (2026). RefRea: Reference-Guided Reasoning with Meta-Cognition for Accurate Language Model Agents. Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), 32465–32473. https://doi.org/10.1609/aaai.v40i38.40522
Issue
Section
AAAI Technical Track on Natural Language Processing III