[1]
W. Zhi, J. Guo, and S. Li, “MedGR2: Breaking the Data Barrier for Medical Reasoning via Generative Reward Learning”, AAAI, vol. 40, no. 34, pp. 28901–28909, Mar. 2026.