HUANG, H.; HE, Y.; LIU, W.; YANG, M.; LIU, J.; CHEN, K.; XU, B.; ZHU, C.; CAO, H.; ZHAO, T. Long-form RewardBench: Evaluating Reward Models for Long-form Generation. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 37, p. 31149-31157, 2026. DOI: 10.1609/aaai.v40i37.40376. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/40376. Acesso em: 3 may. 2026.