Zhai, Y., Yang, T., Xu, K., Feng, D., Yang, C., Ding, B., & Wang, H. (2025). Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models. Proceedings of the AAAI Conference on Artificial Intelligence, 39(25), 27161–27169. https://doi.org/10.1609/aaai.v39i25.34924