Le, H., Abdolshah, M., George, T. K., Do, K., Nguyen, D., & Venkatesh, S. (2022). Episodic Policy Gradient Training. Proceedings of the AAAI Conference on Artificial Intelligence, 36(7), 7317-7325. https://doi.org/10.1609/aaai.v36i7.20694