(1)
Li, Z.; Kiseleva, J.; de Rijke, M. Dialogue Generation: From Imitation Learning to Inverse Reinforcement Learning. AAAI 2019, 33, 6722-6729.