Cohen, A., Qiao, X., Yu, L., Way, E., & Tong, X. (2019). Diverse Exploration via Conjugate Policies for Policy Gradient Methods. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 3404-3411. https://doi.org/10.1609/aaai.v33i01.33013404