Zhao, J., Liu, R., Zhang, K., Zhou, Z., Gao, J., Li, D., … Zhou, B. (2026). GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(41), 34932–34940. https://doi.org/10.1609/aaai.v40i41.40797