Sun, Z., Li, D., Hu, B., & Zhang, M. (2026). Improving Value-based Process Verifier via Low-Cost Variance Reduction. Proceedings of the AAAI Conference on Artificial Intelligence, 40(39), 33162–33170. https://doi.org/10.1609/aaai.v40i39.40600