1.
Zhai Y, Yang T, Xu K, Feng D, Yang C, Ding B, et al. Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models. AAAI [Internet]. 2025 Apr. 11 [cited 2026 May 31];39(25):27161-9. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/34924