(1)
Zhao, J.; Liu, R.; Zhang, K.; Zhou, Z.; Gao, J.; Li, D.; Lyu, J.; Qian, Z.; Qi, B.; Li, X. GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning. AAAI 2026, 40, 34932-34940.