1.
Guo Z, Xu B, Zhu C, Hong W, Wang X, Mao Z. MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 11];40(37):30888-96. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/40347