Jing, Mingxuan, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Chao Yang, Bin Fang, and Huaping Liu. “Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance”. Proceedings of the AAAI Conference on Artificial Intelligence 34, no. 04 (April 3, 2020): 5109-5116. Accessed August 14, 2022. https://ojs.aaai.org/index.php/AAAI/article/view/5953.