Lu, K., Zhang, S., & Chen, X. (2019). Goal-Oriented Dialogue Policy Learning from Failures. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 2596–2603. https://doi.org/10.1609/aaai.v33i01.33012596