Yue, Chuhuai, Chengqi Dong, Yinan Gao, Hang He, Jiajun Chai, Wei Lin, and Guojun Yin. 2026. “Promoting Efficient Reasoning With Verifiable Stepwise Reward”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (41):34530-38. https://doi.org/10.1609/aaai.v40i41.40752.