Iizuka, S., Mochizuki, S., Ohashi, A., Yamashita, S., Guo, A., & Higashinaka, R. (2024). Clarifying the Dialogue-Level Performance of GPT-3.5 and GPT-4 in Task-Oriented and Non-Task-Oriented Dialogue Systems. Proceedings of the AAAI Symposium Series, 2(1), 182–186. https://doi.org/10.1609/aaaiss.v2i1.27668