(1)
Huo, L.; Wang, Z.; Xu, M. Learning Noise-Induced Reward Functions for Surpassing Demonstrations in Imitation Learning. AAAI 2023, 37, 7953-7961.