Zhao, Yangyang, et al. “Automatic Curriculum Learning With Over-Repetition Penalty for Dialogue Policy Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 16, May 2021, pp. 14540-8, doi:10.1609/aaai.v35i16.17709.