Guo, Zikang, Benfeng Xu, Chiwei Zhu, Wentao Hong, Xiaorui Wang, and Zhendong Mao. “MCP-AgentBench: Evaluating Real-World Language Agent Performance With MCP-Mediated Tools”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 37 (March 14, 2026): 30888–30896. Accessed May 11, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/40347.