Zhu, Z., Lin, K., Dai, B., & Zhou, J. (2022). Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-optimal Demonstrations. Proceedings of the AAAI Conference on Artificial Intelligence, 36(8), 9269–9277. https://doi.org/10.1609/aaai.v36i8.20914