Zhao, Y., Wang, Z., & Huang, Z. (2021). Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 35(16), 14540–14548. https://doi.org/10.1609/aaai.v35i16.17709