Cohen, Andrew, Xingye Qiao, Lei Yu, Elliot Way, and Xiangrong Tong. 2019. “Diverse Exploration via Conjugate Policies for Policy Gradient Methods”. Proceedings of the AAAI Conference on Artificial Intelligence 33 (01):3404-11. https://doi.org/10.1609/aaai.v33i01.33013404.