[1]
K. Ji, Y. Guo, Z. Zhang, X. Zhu, Y. Tian, and N. Liu, “MedOmni-45°: A Safety–Performance Benchmark for Reasoning-Oriented LLMs in Medicine”, AAAI, vol. 40, no. 42, pp. 35536–35544, Mar. 2026.