Zhai, Yuanzhao, Tingkai Yang, Kele Xu, Dawei Feng, Cheng Yang, Bo Ding, and Huaimin Wang. 2025. “Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (25):27161-69. https://doi.org/10.1609/aaai.v39i25.34924.