[1]
C. Zhang, Y. Li, and J. Li, “Policy Search by Target Distribution Learning for Continuous Control”, AAAI, vol. 34, no. 04, pp. 6770-6777, Apr. 2020.