[1]

Zhai, Y. et al. 2025. Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models. Proceedings of the AAAI Conference on Artificial Intelligence. 39, 25 (Apr. 2025), 27161–27169. DOI:https://doi.org/10.1609/aaai.v39i25.34924.