Ding, Zezhen, Zhen Tan, Jiheng Zhang, and Tianlong Chen. 2026. “OR-R1: Automating Modeling and Solving of Operations Research Optimization Problem via Test-Time Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 40 (1):228-36. https://doi.org/10.1609/aaai.v40i1.36983.