[1]
L. Yang, Q. Zheng, and G. Pan, “Sample Complexity of Policy Gradient Finding Second-Order Stationary Points”, AAAI, vol. 35, no. 12, pp. 10630-10638, May 2021.