(1)
Morimura, T.; Osogami, T.; Shirai, T. Mixing-Time Regularized Policy Gradient. AAAI 2014, 28.