Pan, F., Cai, Q., Zeng, A.-X., Pan, C.-X., Da, Q., He, H., He, Q. and Tang, P. (2019) “Policy Optimization with Model-Based Explorations”, Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), pp. 4675-4682. doi: 10.1609/aaai.v33i01.33014675.