[1]
A. Cohen, X. Qiao, L. Yu, E. Way, and X. Tong, “Diverse Exploration via Conjugate Policies for Policy Gradient Methods”, AAAI, vol. 33, no. 01, pp. 3404-3411, Jul. 2019.