Morimura, Tetsuro, Takayuki Osogami, and Tomoyuki Shirai. “Mixing-Time Regularized Policy Gradient”. Proceedings of the AAAI Conference on Artificial Intelligence 28, no. 1 (June 21, 2014). Accessed October 12, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/9013.