[1]
Z. Ni, “GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging”, AAAI, vol. 40, no. 38, pp. 32564–32572, Mar. 2026.