Zhang, Haodi, Zhichao Zeng, Keting Lu, Kaishun Wu, and Shiqi Zhang. “Efficient Dialog Policy Learning by Reasoning With Contextual Knowledge”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 10 (June 28, 2022): 11667-11675. Accessed April 22, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/21421.