[1]
Z. Guo, B. Xu, C. Zhu, W. Hong, X. Wang, and Z. Mao, “MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools”, AAAI, vol. 40, no. 37, pp. 30888–30896, Mar. 2026.