[1]
W. Yang, L. Xie, J. Cai, Y. Yan, H.-N. Dai, and H. Wang, “Talk2Code: A Multi-Turn Interaction Benchmark with Dual-Track Evaluation for Code Generation”, AAAI, vol. 40, no. 40, pp. 34331–34339, Mar. 2026.