Zhu, Zhuangdi, Kaixiang Lin, Bo Dai, and Jiayu Zhou. 2022. “Self-Adaptive Imitation Learning: Learning Tasks With Delayed Rewards from Sub-Optimal Demonstrations”. Proceedings of the AAAI Conference on Artificial Intelligence 36 (8):9269-77. https://doi.org/10.1609/aaai.v36i8.20914.