Wu, Yuexin, Xiujun Li, Jingjing Liu, Jianfeng Gao, and Yiming Yang. “Switch-Based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 33, no. 01 (July 17, 2019): 7289-7296. Accessed March 29, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/4715.