[1]

M. Jing, “Reinforcement Learning from Imperfect Demonstrations under Soft Expert Guidance”, AAAI, vol. 34, no. 04, pp. 5109-5116, Apr. 2020.