Iizuka, S. (2024) “Clarifying the Dialogue-Level Performance of GPT-3.5 and GPT-4 in Task-Oriented and Non-Task-Oriented Dialogue Systems”, Proceedings of the AAAI Symposium Series, 2(1), pp. 182–186. doi: 10.1609/aaaiss.v2i1.27668.