1.
Wu Y, Li X, Liu J, Gao J, Yang Y. Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning. AAAI [Internet]. 2019Jul.17 [cited 2024May18];33(01):7289-96. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/4715