Zhao, Jian, Runze Liu, Kaiyan Zhang, Zhimu Zhou, Junqi Gao, Dong Li, Jiafei Lyu, et al. 2026. “GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (41):34932-40. https://doi.org/10.1609/aaai.v40i41.40797.