Zhang, M. S., Erdogdu, M. A. and Garg, A. (2022) “Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings”, Proceedings of the AAAI Conference on Artificial Intelligence, 36(8), pp. 9066–9073. doi: 10.1609/aaai.v36i8.20891.