[1]
H. Le, M. Abdolshah, T. K. George, K. Do, D. Nguyen, and S. Venkatesh, “Episodic Policy Gradient Training”, AAAI, vol. 36, no. 7, pp. 7317–7325, Jun. 2022.