Cohen, Andrew, Xingye Qiao, Lei Yu, Elliot Way, and Xiangrong Tong. “Diverse Exploration via Conjugate Policies for Policy Gradient Methods”. Proceedings of the AAAI Conference on Artificial Intelligence 33, no. 01 (July 17, 2019): 3404-3411. Accessed April 18, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/4215.