Pan, F., Q. Cai, A.-X. Zeng, C.-X. Pan, Q. Da, H. He, Q. He, and P. Tang. “Policy Optimization With Model-Based Explorations”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, July 2019, pp. 4675-82, doi:10.1609/aaai.v33i01.33014675.