Cohen, D., & Lane, I. (2016). An Oral Exam for Measuring a Dialog System’s Capabilities. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10060