Cohen, A., Qiao, X., Yu, L., Way, E. and Tong, X. (2019) “Diverse Exploration via Conjugate Policies for Policy Gradient Methods”, Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), pp. 3404-3411. doi: 10.1609/aaai.v33i01.33013404.