Ouyang, S. (2026) “DSCodeBench: A Realistic Benchmark for Data Science Code Generation”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(38), pp. 32628–32636. doi: 10.1609/aaai.v40i38.40540.