[1]

Wu, Y., Li, X., Liu, J., Gao, J. and Yang, Y. 2019. Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning. Proceedings of the AAAI Conference on Artificial Intelligence. 33, 01 (Jul. 2019), 7289-7296. DOI:https://doi.org/10.1609/aaai.v33i01.33017289.