(1)
Zhai, Y.; Yang, T.; Xu, K.; Feng, D.; Yang, C.; Ding, B.; Wang, H. Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models. AAAI 2025, 39, 27161-27169.