[1]
Y. Zhai, “Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models”, AAAI, vol. 39, no. 25, pp. 27161–27169, Apr. 2025.