Wu, Y., Li, X., Liu, J., Gao, J. and Yang, Y. (2019) “Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning”, Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), pp. 7289-7296. doi: 10.1609/aaai.v33i01.33017289.