WEI, W.; LIU, Y.-A.; ZHANG, R.; GUO, J.; SU, L.; WANG, S.; YIN, D.; RIJKE, M. de; CHENG, X. Thinking Forward and Backward: Multi-Objective Reinforcement Learning for Retrieval-Augmented Reasoning. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 40, p. 33836-33844, 2026. DOI: 10.1609/aaai.v40i40.40675. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/40675. Acesso em: 2 may. 2026.