Zhi, W., Guo, J., & Li, S. (2026). MedGR2: Breaking the Data Barrier for Medical Reasoning via Generative Reward Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(34), 28901–28909. https://doi.org/10.1609/aaai.v40i34.40125