Zhu, Zhuangdi, Kaixiang Lin, Bo Dai, and Jiayu Zhou. “Self-Adaptive Imitation Learning: Learning Tasks With Delayed Rewards from Sub-Optimal Demonstrations”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 8 (June 28, 2022): 9269-9277. Accessed July 10, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/20914.