Zhu, Q. (2025) “DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation”, Proceedings of the AAAI Conference on Artificial Intelligence, 39(24), pp. 26148–26156. doi: 10.1609/aaai.v39i24.34811.