DING, Zezhen; TAN, Zhen; ZHANG, Jiheng; CHEN, Tianlong. OR-R1: Automating Modeling and Solving of Operations Research Optimization Problem via Test-Time Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 40, n. 1, p. 228–236, 2026. DOI: 10.1609/aaai.v40i1.36983. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/36983. Acesso em: 25 may. 2026.