Ding, Zezhen, et al. “OR-R1: Automating Modeling and Solving of Operations Research Optimization Problem via Test-Time Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 1, Mar. 2026, pp. 228-36, doi:10.1609/aaai.v40i1.36983.